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Quantum Theories 



Quantum mechanics 



Quantum mechanics, also known as quantum 
physics or quantum theory, is a branch of 
physics providing a mathematical description of 
much of the dual particle-like and wave-like 
behavior and interactions of energy and matter. It 
departs from classical mechanics primarily at the 
atomic and subatomic scales, the so-called 
quantum realm. In advanced topics of quantum 
mechanics, some of these behaviors are 
macroscopic and only emerge at very low or very 
high energies or temperatures. The name, coined 
by Max Planck, derives from the observation that 
some physical quantities can be changed only by 
discrete amounts, or quanta, as multiples of the 
Planck constant, rather than being capable of 
varying continuously or by any arbitrary amount. 
For example, the angular momentum, or more 
generally the action, of an electron bound into an 
atom or molecule is quantized. While an unbound 
electron does not exhibit quantized energy levels, 
an electron bound in an atomic orbital has 
quantized values of angular momentum. In the 
context of quantum mechanics, the wave-particle 
duality of energy and matter and the uncertainty 
principle provide a unified view of the behavior 
of photons, electrons and other atomic- scale 
objects. 
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Fig. 1: Probability densities corresponding to the wavefunctions of an 
electron in a hydrogen atom possessing definite energy levels (increasing 

from the top of the image to the bottom: n = 1, 2, 3, ...) and angular 
momentum (increasing across from left to right: s, p, d, ...). Brighter areas 

correspond to higher probability density in a position measurement. 
Wavefunctions like these are directly comparable to Chladni's figures of 
acoustic modes of vibration in classical physics and are indeed modes of 
oscillation as well: they possess a sharp energy and thus a keen frequency. 
The angular momentum and energy are quantized, and only take on discrete 
values like those shown (as is the case for resonant frequencies in 
acoustics). 



The mathematical formulations of quantum mechanics are abstract. Similarly, the implications are often 
non-intuitive in terms of classic physics. The centerpiece of the mathematical system is the wavefunction. The 
wavefunction is a mathematical function providing information about the probability amplitude of position and 
momentum of a particle. Mathematical manipulations of the wavefunction usually involve the bra-ket notation, 
which requires an understanding of complex numbers and linear functionals. The wavefunction treats the object as a 
quantum harmonic oscillator and the mathematics is akin to that of acoustic resonance. Many of the results of 
quantum mechanics do not have models that are easily visualized in terms of classical mechanics; for instance, the 
ground state in the quantum mechanical model is a non-zero energy state that is the lowest permitted energy state of 
a system, rather than a more traditional system that is thought of as simply being at rest with zero kinetic energy. 

Historically, the earliest versions of quantum mechanics were formulated in the first decade of the 20th century at 
around the same time as the atomic theory and the corpuscular theory of light as updated by Einstein first came to be 
widely accepted as scientific fact; these latter theories can be viewed as quantum theories of matter and 
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electromagnetic radiation. Quantum theory was significantly reformulated in the mid- 1920s away from the old 
quantum theory towards the quantum mechanics formulated by Werner Heisenberg, Max Born, Wolfgang Pauli and 
their associates, accompanied by the acceptance of the Copenhagen interpretation of Niels Bohr. By 1930, quantum 
mechanics had been further unified and formalized by the work of Paul Dirac and John von Neumann, with a greater 
emphasis placed on measurement in quantum mechanics, the statistical nature of our knowledge of reality and 
philosophical speculation about the role of the observer. Quantum mechanics has since branched out into almost 
every aspect of 20th century physics and other disciplines such as quantum chemistry, quantum electronics, quantum 
optics and quantum information science. Much 19th century physics has been re-evaluated as the classical limit of 
quantum mechanics, and its more advanced developments in terms of quantum field theory, string theory, and 
speculative quantum gravity theories. 

History 

The history of quantum mechanics dates back to the 1838 discovery of cathode rays by Michael Faraday. This was 
followed by the 1859 statement of the black body radiation problem by Gustav Kirchhoff, the 1877 suggestion by 
Ludwig Boltzmann that the energy states of a physical system can be discrete, and the 1900 quantum hypothesis of 
Max Planck. tl] Planck's hypothesis that energy is radiated and absorbed in discrete "quanta", or "energy elements", 
enabled the correct derivation of the observed patterns of black body radiation. According to Planck, each energy 
element E is proportional to its frequency v: 

E = hv 

where h is Planck's action constant. Planck cautiously insisted that this was simply an aspect of the processes of 
absorption and emission of radiation and had nothing to do with the physical reality of the radiation itself. 
However, in 1905 Albert Einstein interpreted Planck's quantum hypothesis realistically and used it to explain the 
photoelectric effect, in which shining light on certain materials can eject electrons from the material. Einstein 
postulated that light itself consists of individual quanta of energy, later called photons. 1 J 

The foundations of quantum mechanics were established during the first half of the twentieth century by Niels Bohr, 
Werner Heisenberg, Max Planck, Louis de Broglie, Albert Einstein, Erwin Schrodinger, Max Born, John von 
Neumann, Paul Dirac, Wolfgang Pauli, David Hilbert, and others. In the mid- 1920s, developments in quantum 
mechanics quickly led to its becoming the standard formulation for atomic physics. In the summer of 1925, Bohr and 
Heisenberg published results that closed the "Old Quantum Theory". Out of deference to their dual state as particles, 
light quanta came to be called photons (1926). From Einstein's simple postulation was born a flurry of debating, 
theorizing and testing. Thus, the entire field of quantum physics emerged leading to its wider acceptance at the Fifth 
Solvay Conference in 1927. 

The other exemplar that led to quantum mechanics was the study of electromagnetic waves such as light. When it 
was found in 1900 by Max Planck that the energy of waves could be described as consisting of small packets or 
quanta, Albert Einstein further developed this idea to show that an electromagnetic wave such as light could be 
described by a particle called the photon with a discrete energy dependent on its frequency. This led to a theory of 
unity between subatomic particles and electromagnetic waves called wave-particle duality in which particles and 
waves were neither one nor the other, but had certain properties of both. While quantum mechanics describes the 
world of the very small, it also is needed to explain certain macroscopic quantum systems such as superconductors 
and superfluids. 

The word quantum derives from Latin meaning "how great" or "how much" J 4] In quantum mechanics, it refers to a 
discrete unit that quantum theory assigns to certain physical quantities, such as the energy of an atom at rest (see 
Figure 1). The discovery that particles are discrete packets of energy with wave-like properties led to the branch of 
physics that deals with atomic and subatomic systems which is today called quantum mechanics. It is the underlying 
mathematical framework of many fields of physics and chemistry, including condensed matter physics, solid-state 
physics, atomic physics, molecular physics, computational physics, computational chemistry, quantum chemistry, 
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particle physics, nuclear chemistry, and nuclear physics. Some fundamental aspects of the theory are still actively 
studied.^ Quantum mechanics is essential to understand the behavior of systems at atomic length scales and smaller. 
For example, if classical mechanics governed the workings of an atom, electrons would rapidly travel towards and 
collide with the nucleus, making stable atoms impossible. However, in the natural world the electrons normally 
remain in an uncertain, non-deterministic "smeared" (wave-particle wave function) orbital path around or through 
the nucleus, defying classical electromagnetism. Quantum mechanics was initially developed to provide a better 
explanation of the atom, especially the spectra of light emitted by different atomic species. The quantum theory of 
the atom was developed as an explanation for the electron's staying in its orbital, which could not be explained by 
Newton's laws of motion and by Maxwell's laws of classical electromagnetism. Broadly speaking, quantum 
mechanics incorporates four classes of phenomena for which classical physics cannot account: 

• The quantization (discretization) of certain physical quantities 

• wave-particle duality 

• uncertainty principle 

• quantum entanglement 



Mathematical formulations 

In the mathematically rigorous formulation of quantum mechanics developed by Paul Dirac^ and John von 

rm 

Neumann, 1 the possible states of a quantum mechanical system are represented by unit vectors (called "state 
vectors"). Formally, these reside in a complex separable Hilbert space (variously called the "state space" or the 
"associated Hilbert space" of the system) well defined up to a complex number of norm 1 (the phase factor). In other 
words, the possible states are points in the projectivization of a Hilbert space, usually called the complex projective 
space. The exact nature of this Hilbert space is dependent on the system; for example, the state space for position and 
momentum states is the space of square-integrable functions, while the state space for the spin of a single proton is 
just the product of two complex planes. Each observable is represented by a maximally Hermitian (precisely: by a 
self-adjoint) linear operator acting on the state space. Each eigenstate of an observable corresponds to an eigenvector 
of the operator, and the associated eigenvalue corresponds to the value of the observable in that eigenstate. If the 
operator's spectrum is discrete, the observable can only attain those discrete eigenvalues. 

In the formalism of quantum mechanics, the state of a system at a given time is described by a complex wave 
function, also referred to as state vector in a complex vector space J 10 ^ This abstract mathematical object allows for 
the calculation of probabilities of outcomes of concrete experiments. For example, it allows one to compute the 
probability of finding an electron in a particular region around the nucleus at a particular time. Contrary to classical 
mechanics, one can never make simultaneous predictions of conjugate variables, such as position and momentum, 
with accuracy. For instance, electrons may be considered to be located somewhere within a region of space, but with 
their exact positions being unknown. Contours of constant probability, often referred to as "clouds", may be drawn 
around the nucleus of an atom to conceptualize where the electron might be located with the most probability. 
Heisenberg's uncertainty principle quantifies the inability to precisely locate the particle given its conjugate 
momentum J- U ^ 

As the result of a measurement, the wave function containing the probability information for a system collapses from 
a given initial state to a particular eigenstate of the observable. The possible results of a measurement are the 
eigenvalues of the operator representing the observable — which explains the choice of Hermitian operators, for 
which all the eigenvalues are real. We can find the probability distribution of an observable in a given state by 
computing the spectral decomposition of the corresponding operator. Heisenberg's uncertainty principle is 
represented by the statement that the operators corresponding to certain observables do not commute. 

The probabilistic nature of quantum mechanics thus stems from the act of measurement. This is one of the most 
difficult aspects of quantum systems to understand. It was the central topic in the famous Bohr-Einstein debates, in 
which the two scientists attempted to clarify these fundamental principles by way of thought experiments. In the 
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decades after the formulation of quantum mechanics, the question of what constitutes a "measurement" has been 

extensively studied. Interpretations of quantum mechanics have been formulated to do away with the concept of 

"wavefunction collapse"; see, for example, the relative state interpretation. The basic idea is that when a quantum 

system interacts with a measuring apparatus, their respective wavefunctions become entangled, so that the original 

quantum system ceases to exist as an independent entity. For details, see the article on measurement in quantum 

mechanics. Generally, quantum mechanics does not assign definite values to observables. Instead, it makes 

predictions using probability distributions; that is, the probability of obtaining possible outcomes from measuring an 

ri3i 

observable. Often these results are skewed by many causes, such as dense probability clouds or quantum state 
nuclear attraction J ^ Naturally, these probabilities will depend on the quantum state at the "instant" of the 
measurement. Hence, uncertainty is involved in the value. There are, however, certain states that are associated with 
a definite value of a particular observable. These are known as eigenstates of the observable ("eigen" can be 
translated from German as inherent or as a characteristic).^ 

In the everyday world, it is natural and intuitive to think of everything (every observable) as being in an eigenstate. 

Everything appears to have a definite position, a definite momentum, a definite energy, and a definite time of 

occurrence. However, quantum mechanics does not pinpoint the exact values of a particle for its position and 

momentum (since they are conjugate pairs) or its energy and time (since they too are conjugate pairs); rather, it only 

provides a range of probabilities of where that particle might be given its momentum and momentum probability. 

Therefore, it is helpful to use different words to describe states having uncertain values and states having definite 

values (eigenstate). Usually, a system will not be in an eigenstate of the observable we are interested in. However, if 

one measures the observable, the wavefunction will instantaneously be an eigenstate (or generalized eigenstate) of 

ri7i 

that observable. This process is known as wavefunction collapse, a debatable process. It involves expanding the 

system under study to include the measurement device. If one knows the corresponding wave function at the instant 

before the measurement, one will be able to compute the probability of collapsing into each of the possible 

eigenstates. For example, the free particle in the previous example will usually have a wavefunction that is a wave 

packet centered around some mean position x , neither an eigenstate of position nor of momentum. When one 

ri2i 

measures the position of the particle, it is impossible to predict with certainty the result. It is probable, but not 

certain, that it will be near x , where the amplitude of the wave function is large. After the measurement is 

ri8i 

performed, having obtained some result x, the wave function collapses into a position eigenstate centered at x. 

The time evolution of a quantum state is described by the Schrodinger equation, in which the Hamiltonian, the 

operator corresponding to the total energy of the system, generates time evolution. The time evolution of wave 

functions is deterministic in the sense that, given a wavefunction at an initial time, it makes a definite prediction of 

ri9i 

what the wavefunction will be at any later time. 

During a measurement, on the other hand, the change of the wavefunction into another one is not deterministic, but 
rather unpredictable, i.e., random. A time-evolution simulation can be seen hereJ 20 ^ ^ Wave functions can change 
as time progresses. An equation known as the Schrodinger equation describes how wave functions change in time, a 
role similar to Newton's second law in classical mechanics. The Schrodinger equation, applied to the aforementioned 
example of the free particle, predicts that the center of a wave packet will move through space at a constant velocity, 
like a classical particle with no forces acting on it. However, the wave packet will also spread out as time progresses, 
which means that the position becomes more uncertain. This also has the effect of turning position eigenstates 
(which can be thought of as infinitely sharp wave packets) into broadened wave packets that are no longer position 
eigenstates P 2 ^ 

Some wave functions produce probability distributions that are constant, or independent of time, such as when in a 
stationary state of constant energy, time drops out of the absolute square of the wave function. Many systems that are 
treated dynamically in classical mechanics are described by such "static" wave functions. For example, a single 
electron in an unexcited atom is pictured classically as a particle moving in a circular trajectory around the atomic 
nucleus, whereas in quantum mechanics it is described by a static, spherically symmetric wavefunction surrounding 
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the nucleus (Fig. 1). (Note that only the lowest angular momentum states, labeled s, are spherically symmetric). 

The Schrodinger equation acts on the entire probability amplitude, not merely its absolute value. Whereas the 
absolute value of the probability amplitude encodes information about probabilities, its phase encodes information 
about the interference between quantum states. This gives rise to the wave-like behavior of quantum states. It turns 
out that analytic solutions of Schrodinger's equation are only available for a small number of model Hamiltonians, of 
which the quantum harmonic oscillator, the particle in a box, the hydrogen molecular ion and the hydrogen atom are 
the most important representatives. Even the helium atom, which contains just one more electron than hydrogen, 
defies all attempts at a fully analytic treatment. There exist several techniques for generating approximate solutions. 
For instance, in the method known as perturbation theory one uses the analytic results for a simple quantum 
mechanical model to generate results for a more complicated model related to the simple model by, for example, the 
addition of a weak potential energy. Another method is the "semi-classical equation of motion" approach, which 
applies to systems for which quantum mechanics produces weak deviations from classical behavior. The deviations 
can be calculated based on the classical motion. This approach is important for the field of quantum chaos. 

There are numerous mathematically equivalent formulations of quantum mechanics. One of the oldest and most 
commonly used formulations is the transformation theory proposed by Cambridge theoretical physicist Paul Dirac, 
which unifies and generalizes the two earliest formulations of quantum mechanics, matrix mechanics (invented by 
Werner Heisenberg)^ ^ and wave mechanics (invented by Erwin Schrodinger).^ 26 ^ In this formulation, the 
instantaneous state of a quantum system encodes the probabilities of its measurable properties, or "observables". 

Examples of observables include energy, position, momentum, and angular momentum. Observables can be either 

i"27] 

continuous (e.g., the position of a particle) or discrete (e.g., the energy of an electron bound to a hydrogen atom). 
An alternative formulation of quantum mechanics is Feynman's path integral formulation, in which a 
quantum-mechanical amplitude is considered as a sum over histories between initial and final states; this is the 
quantum-mechanical counterpart of action principles in classical mechanics. 



Interactions with other scientific theories 

The fundamental rules of quantum mechanics are very deep. They assert that the state space of a system is a Hilbert 
space and the observables are Hermitian operators acting on that space, but do not tell us which Hilbert space or 
which operators, or if it even exists. These must be chosen appropriately in order to obtain a quantitative description 
of a quantum system. An important guide for making these choices is the correspondence principle, which states that 
the predictions of quantum mechanics reduce to those of classical physics when a system moves to higher energies 
or equivalently, larger quantum numbers. In other words, classical mechanics is simply a quantum mechanics of 
large systems. This "high energy" limit is known as the classical or correspondence limit. One can therefore start 
from an established classical model of a particular system, and attempt to guess the underlying quantum model that 
gives rise to the classical model in the correspondence limit. 

When quantum mechanics was originally formulated, it was applied to models whose correspondence limit was 
non-relativistic classical mechanics. For instance, the well-known model of the quantum harmonic oscillator uses an 
explicitly non-relativistic expression for the kinetic energy of the oscillator, and is thus a quantum version of the 
classical harmonic oscillator. 

Early attempts to merge quantum mechanics with special relativity involved the replacement of the Schrodinger 
equation with a covariant equation such as the Klein-Gordon equation or the Dirac equation. While these theories 
were successful in explaining many experimental results, they had certain unsatisfactory qualities stemming from 
their neglect of the relativistic creation and annihilation of particles. A fully relativistic quantum theory required the 
development of quantum field theory, which applies quantization to a field rather than a fixed set of particles. The 
first complete quantum field theory, quantum electrodynamics, provides a fully quantum description of the 
electromagnetic interaction. The full apparatus of quantum field theory is often unnecessary for describing 
electrodynamic systems. A simpler approach, one employed since the inception of quantum mechanics, is to treat 
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charged particles as quantum mechanical objects being acted on by a classical electromagnetic field. For example, 
the elementary quantum model of the hydrogen atom describes the electric field of the hydrogen atom using a 
classical - 47r e eQ Coulomb potential. This "semi-classical" approach fails if quantum fluctuations in the 

electromagnetic field play an important role, such as in the emission of photons by charged particles. Quantum field 
theories for the strong nuclear force and the weak nuclear force have been developed. The quantum field theory of 
the strong nuclear force is called quantum chromodynamics, and describes the interactions of the subnuclear 
particles: quarks and gluons. The weak nuclear force and the electromagnetic force were unified, in their quantized 
forms, into a single quantum field theory known as electroweak theory, by the physicists Abdus Salam, Sheldon 
Glashow and Steven Weinberg. These three men shared the Nobel Prize in Physics in 1979 for this work J 28 -' 
It has proven difficult to construct quantum models of gravity, the remaining fundamental force. Semi-classical 
approximations are workable, and have led to predictions such as Hawking radiation. However, the formulation of a 
complete theory of quantum gravity is hindered by apparent incompatibilities between general relativity, the most 
accurate theory of gravity currently known, and some of the fundamental assumptions of quantum theory. The 
resolution of these incompatibilities is an area of active research, and theories such as string theory are among the 
possible candidates for a future theory of quantum gravity. Classical mechanics has been extended into the complex 
domain and complex classical mechanics exhibits behaviours similar to quantum mechanics. 

Quantum mechanics and classical physics 

Predictions of quantum mechanics have been verified experimentally to a very high degree of accuracy. According 
to the correspondence principle between classical and quantum mechanics, all objects obey the laws of quantum 
mechanics, and classical mechanics is just an approximation for large systems (or a statistical quantum mechanics of 
a large collection of particles). The laws of classical mechanics thus follow from the laws of quantum mechanics as a 
statistical average at the limit of large systems or large quantum numbers P 0 ^ However, chaotic systems do not have 
good quantum numbers, and quantum chaos studies the relationship between classical and quantum descriptions in 
these systems. 

Quantum coherence is an essential difference between classical and quantum theories, and is illustrated by the 
Einstein-Podolsky-Rosen paradox. Quantum interference involves the addition of probability amplitudes, whereas 
when classical waves interfere there is an addition of intensities. For microscopic bodies, the extension of the system 
is much smaller than the coherence length, which gives rise to long-range entanglement and other nonlocal 
phenomena characteristic of quantum systems. 1 Quantum coherence is not typically evident at macroscopic scales, 
although an exception to this rule can occur at extremely low temperatures, when quantum behavior can manifest 
itself on more macroscopic scales (see Bose-Einstein condensate). This is in accordance with the following 
observations: 

• Many macroscopic properties of a classical system are a direct consequences of the quantum behavior of its parts. 
For example, the stability of bulk matter (which consists of atoms and molecules which would quickly collapse 
under electric forces alone), the rigidity of solids, and the mechanical, thermal, chemical, optical and magnetic 
properties of matter are all results of the interaction of electric charges under the rules of quantum mechanics. 1 J 

• While the seemingly exotic behavior of matter posited by quantum mechanics and relativity theory become more 
apparent when dealing with extremely fast-moving or extremely tiny particles, the laws of classical Newtonian 
physics remain accurate in predicting the behavior of large objects — of the order of the size of large molecules 
and bigger — at velocities much smaller than the velocity of light. 
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Relativity and quantum mechanics 

Main articles: Quantum gravity and Theory of everything 

Even with the defining postulates of both Einstein's theory of general relativity and quantum theory being 
indisputably supported by rigorous and repeated empirical evidence and while they do not directly contradict each 
other theoretically (at least with regard to primary claims), they are resistant to being incorporated within one 
cohesive model P 4 ^ 

Einstein himself is well known for rejecting some of the claims of quantum mechanics. While clearly contributing to 
the field, he did not accept the more philosophical consequences and interpretations of quantum mechanics, such as 
the lack of deterministic causality and the assertion that a single subatomic particle can occupy numerous areas of 
space at one time. He also was the first to notice some of the apparently exotic consequences of entanglement and 
used them to formulate the Einstein-Podolsky-Rosen paradox, in the hope of showing that quantum mechanics had 
unacceptable implications. This was 1935, but in 1964 it was shown by John Bell (see Bell inequality) that, although 
Einstein was correct in identifying seemingly paradoxical implications of quantum mechanical nonlocality, these 
implications could be experimentally tested. Alain Aspect's initial experiments in 1982, and many subsequent 
experiements since, have verified quantum entanglement. 

According to the paper of J. Bell and the Copenhagen interpretation (the common interpretation of quantum 
mechanics by physicists since 1927), and contrary to Einstein's ideas, quantum mechanics was not at the same time 

• a "realistic" theory 

• and a local theory. 

The Einstein-Podolsky-Rosen paradox shows in any case that there exist experiments by which one can measure the 
state of one particle and instantaneously change the state of its entangled partner, although the two particles can be 
an arbitrary distance apart; however, this effect does not violate causality, since no transfer of information happens. 
Quantum entanglement is at the basis of quantum cryptography, with high-security commercial applications in 
banking and government. 

Gravity is negligible in many areas of particle physics, so that unification between general relativity and quantum 
mechanics is not an urgent issue in those applications. However, the lack of a correct theory of quantum gravity is an 
important issue in cosmology and physicists' search for an elegant "theory of everything". Thus, resolving the 
inconsistencies between both theories has been a major goal of twentieth- and twenty-first-century physics. Many 
prominent physicists, including Stephen Hawking, have labored in the attempt to discover a theory underlying 
everything, combining not only different models of subatomic physics, but also deriving the universe's four 
forces — the strong force, electromagnetism, weak force, and gravity — from a single force or phenomenon. One of 
the leaders in this field is Edward Witten, a theoretical physicist who formulated the groundbreaking M-theory, 
which is an attempt at describing the supersymmetrical based string theory. 

Attempts at a unified field theory 

As of 2010 the quest for unifying the fundamental forces through quantum mechanics is still ongoing. Quantum 

electrodynamics (or "quantum electromagnetism"), which is currently (in the perturbative regime at least) the most 

accurately tested physical theory, has been successfully merged with the weak nuclear force into the electro weak 

force and work is currently being done to merge the electro weak and strong force into the electrostrong force. 

Current predictions state that at around 10 14 GeV the three aforementioned forces are fused into a single unified 

field, ^ Beyond this "grand unification," it is speculated that it may be possible to merge gravity with the other 

19 

three gauge symmetries, expected to occur at roughly 10 GeV. However — and while special relativity is 
parsimoniously incorporated into quantum electrodynamics — the expanded general relativity, currently the best 
theory describing the gravitation force, has not been fully incorporated into quantum theory. 
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Philosophical implications 

Since its inception, the many counter-intuitive results of quantum mechanics have provoked strong philosophical 
debate and many interpretations. Even fundamental issues such as Max Born's basic rules concerning probability 
amplitudes and probability distributions took decades to be appreciated. 

Richard Feynman said, "I think I can safely say that nobody understands quantum mechanics.' 

The Copenhagen interpretation, due largely to the Danish theoretical physicist Niels Bohr, is the interpretation of the 
quantum mechanical formalism most widely accepted amongst physicists. According to it, the probabilistic nature of 
quantum mechanics is not a temporary feature which will eventually be replaced by a deterministic theory, but 
instead must be considered to be a final renunciation of the classical ideal of causality. In this interpretation, it is 
believed that any well-defined application of the quantum mechanical formalism must always make reference to the 
experimental arrangement, due to the complementarity nature of evidence obtained under different experimental 
situations. 

Albert Einstein, himself one of the founders of quantum theory, disliked this loss of determinism in measurement. 
(This dislike is the source of his famous quote, "God does not play dice with the universe.") Einstein held that there 
should be a local hidden variable theory underlying quantum mechanics and that, consequently, the present theory 
was incomplete. He produced a series of objections to the theory, the most famous of which has become known as 
the Einstein-Podolsky-Rosen paradox. John Bell showed that the EPR paradox led to experimentally testable 
differences between quantum mechanics and local realistic theories. Experiments have been performed confirming 
the accuracy of quantum mechanics, thus demonstrating that the physical world cannot be described by local realistic 

T381 

theories. The Bohr-Einstein debates provide a vibrant critique of the Copenhagen Interpretation from an 
epistemological point of view. 

The Everett many-worlds interpretation, formulated in 1956, holds that all the possibilities described by quantum 
theory simultaneously occur in a multiverse composed of mostly independent parallel universes P 9 ^ This is not 
accomplished by introducing some new axiom to quantum mechanics, but on the contrary by removing the axiom of 
the collapse of the wave packet: All the possible consistent states of the measured system and the measuring 
apparatus (including the observer) are present in a real physical (not just formally mathematical, as in other 
interpretations) quantum superposition. Such a superposition of consistent state combinations of different systems is 
called an entangled state. While the multiverse is deterministic, we perceive non-deterministic behavior governed by 
probabilities, because we can observe only the universe, i.e. the consistent state contribution to the mentioned 
superposition, we inhabit. Everett's interpretation is perfectly consistent with John Bell's experiments and makes 
them intuitively understandable. However, according to the theory of quantum decoherence, the parallel universes 
will never be accessible to us. This inaccessibility can be understood as follows: Once a measurement is done, the 
measured system becomes entangled with both the physicist who measured it and a huge number of other particles, 
some of which are photons flying away towards the other end of the universe; in order to prove that the wave 
function did not collapse one would have to bring all these particles back and measure them again, together with the 
system that was measured originally. This is completely impractical, but even if one could theoretically do this, it 
would destroy any evidence that the original measurement took place (including the physicist's memory). 
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Applications 

Quantum mechanics had enormous success in explaining many of the features of our world. The individual 
behaviour of the subatomic particles that make up all forms of matter — electrons, protons, neutrons, photons and 
others — can often only be satisfactorily described using quantum mechanics. Quantum mechanics has strongly 
influenced string theory, a candidate for a theory of everything (see reductionism) and the multi verse hypothesis. 

Quantum mechanics is important for understanding how individual atoms combine covalently to form chemicals or 
molecules. The application of quantum mechanics to chemistry is known as quantum chemistry. (Relativistic) 
quantum mechanics can in principle mathematically describe most of chemistry. Quantum mechanics can provide 
quantitative insight into ionic and covalent bonding processes by explicitly showing which molecules are 
energetically favorable to which others, and by approximately how muchJ 40 ^ Most of the calculations performed in 
computational chemistry rely on quantum mechanics J 4 ^ 

Much of modern technology operates 
at a scale where quantum effects are 
significant. Examples include the laser, 
the transistor (and thus the microchip), 
the electron microscope, and magnetic 
resonance imaging. The study of 
semiconductors led to the invention of 
the diode and the transistor, which are 
indispensable for modern electronics. 

Researchers are currently seeking 
robust methods of directly 
manipulating quantum states. Efforts 
are being made to develop quantum 
cryptography, which will allow 
guaranteed secure transmission of 
information. A more distant goal is the 
development of quantum computers, 
which are expected to perform certain 
computational tasks exponentially 
faster than classical computers. Another active research topic is quantum teleportation, which deals with techniques 
to transmit quantum information over arbitrary distances. 

Quantum tunneling is vital in many devices, even in the simple light switch, as otherwise the electrons in the electric 
current could not penetrate the potential barrier made up of a layer of oxide. Flash memory chips found in USB 
drives use quantum tunneling to erase their memory cells. 

Quantum mechanics primarily applies to the atomic regimes of matter and energy, but some systems exhibit 
quantum mechanical effects on a large scale; superfluidity (the frictionless flow of a liquid at temperatures near 
absolute zero) is one well-known example. Quantum theory also provides accurate descriptions for many previously 
unexplained phenomena such as black body radiation and the stability of electron orbitals. It has also given insight 
into the workings of many different biological systems, including smell receptors and protein structures.^ Recent 
work on photosynthesis has provided evidence that quantum correlations play an essential role in this most 
fundamental process of the plant kingdom J 43 ^ Even so, classical physics often can be a good approximation to 
results otherwise obtained by quantum physics, typically in circumstances with large numbers of particles or large 
quantum numbers. (However, some open questions remain in the field of quantum chaos.) 
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A working mechanism of a resonant tunneling diode device, based on the phenomenon of 
quantum tunneling through the potential barriers. 
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Examples 



Particle in a box 



The particle in a 1 -dimensional potential energy box is the most simple 
example where restraints lead to the quantization of energy levels. The 
box is defined as having zero potential energy inside a certain region 
and infinite potential energy everywhere outside that region. For the 
1 -dimensional case in the x direction, the time-independent 
Schrodinger equation can be written as:'- 44 -' 




o L x 

1 -dimensional potential energy box (or infinite 
potential well) 



- Eip. 



2m dx 2 

Writing the differential operator 

•* d 

Px = -in— 
dx 

the previous equation can be seen to be evocative of the classic analogue 
— f = E 

with E as me energy for the state if) , in this case coinciding with the kinetic energy of the particle. 
The general solutions of the Schrodinger equation for the particle in a box are: 

h 2 k 2 

i>tx) = Ae ikx + Be~ ikx E = — - 

2m 

or, from Euler's formula, 

ip(x) = C sin kx + D cos kx. 

The presence of the walls of the box determines the values of C, D, and k. At each wall (x = 0 and x = L), xp = 0. 
Thus when x - 0, 

^(0) = 0 = Csin0 + Dcos0 = D 

and so D = 0. When x - L, 

ijj(L) = 0 = CsinfcL. 

C cannot be zero, since this would conflict with the Born interpretation. Therefore sin kL = 0, and so it must be that 
kL is an integer multiple of it. Therefore, 

k = n = 1, 2, 3, ... . 

L 

The quantization of energy levels follows from this constraint on k, since 
n 7r n n h 



E 



2mL 2 8mL 2 ' 
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Free particle 

For example, consider a free particle. 
In quantum mechanics, there is 
wave-particle duality so the properties 
of the particle can be described as the 
properties of a wave. Therefore, its 
quantum state can be represented as a 
wave of arbitrary shape and extending 
over space as a wave function. The 
position and momentum of the particle 
are observables. The Uncertainty 
Principle states that both the position 
and the momentum cannot 
simultaneously be measured with full 
precision at the same time. However, 
one can measure the position alone of a 
moving free particle creating an eigenstate of position with a wavefunction that is very large (a Dirac delta) at a 
particular position x and zero everywhere else. If one performs a position measurement on such a wavefunction, the 
result x will be obtained with 100% probability (full certainty). This is called an eigenstate of position 
(mathematically more precise: a generalized position eigenstate (eigendistribution)). If the particle is in an eigenstate 
of position then its momentum is completely unknown. On the other hand, if the particle is in an eigenstate of 
momentum then its position is completely unknown J 45 ^ In an eigenstate of momentum having a plane wave form, it 
can be shown that the wavelength is equal to h/p, where h is Planck's constant and p is the momentum of the 

* * [46] 

eigenstate. 
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Schrodinger equation 

In physics, specifically quantum mechanics, the Schrodinger equation, formulated in 1926 by Austrian physicist 
Erwin Schrodinger, is an equation that describes how the quantum state of a physical system changes in time. It is as 
central to quantum mechanics as Newton's laws are to classical mechanics. 

dt 

Two forms of the Schrodinger equation 

In the standard interpretation of quantum mechanics, the quantum state, also called a wavefunction or state vector, is 

the most complete description that can be given to a physical system. Solutions to Schrodinger's equation describe 

not only molecular, atomic and subatomic systems, but also macroscopic systems, possibly even the whole universe. 
[1] 

The most general form is the time-dependent Schrodinger equation, which gives a description of a system evolving 
with time. For systems in a stationary state, the time-independent Schrodinger equation is sufficient. Approximate 
solutions to the time-independent Schrodinger equation are commonly used to calculate the energy levels and other 
properties of atoms and molecules. 

Schrodinger's equation can be mathematically transformed into Werner Heisenberg's matrix mechanics, and into 
Richard Feynman's path integral formulation. The Schrodinger equation describes time in a way that is inconvenient 
for relativistic theories, a problem which is not as severe in matrix mechanics and completely absent in the path 
integral formulation. 
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The Schrodinger equation 

The Schrodinger equation takes several different forms, depending on the physical situation. This section presents 
the equation for the general case and for the simple case encountered in many textbooks. 

General quantum system 

For a general quantum system: ^ 

d 

ih—^ = 

at 

where 

• \Jris the wave function; the probability amplitude for different configurations of the system at different times, 

d 

• ifi — is the energy operator ( i is the imaginary unit and ft is the reduced Planck constant), 

dt 

• is the Hamiltonian operator. 

Single particle in a potential 

For a single particle with potential energy V in position space, the Schrodinger equation takes the form. 

•ft^*(r, f) = H9 = (-|^V 2 + V{r)j tt(r, t) = -^V 2 *<r, t)+V(r)*(r, t) 

where 

h 2 

• V 2 ^ s me kinetic energy operator, where m is the mass of the particle. 

2m 

Q2 Q2 Q2 

• V 2 * s me Laplace operator. In three dimensions, the Laplace operator is ^ -| ^ -| ^, where x, y, 

dx 2 dy 2 dz l 

and z are the Cartesian coordinates of space. 

• V (r) is the time-independent potential energy at the position r. 

• ^(r, t) is the probability amplitude for the particle to be found at position r at time t. 

• H = ^— V 2 + V(r) i s me Hamiltonian operator for a single particle in a potential. 

Time independent or stationary equation 

The time independent equation, again for a single particle with potential energy V takes the form:'- 4 -' 

h 2 

Eif>(r) = -— Vfy(r) + V(r)if>(r). 
2m 

This equation describes the standing wave solutions of the time-dependent equation, which are the states with 
definite energy. 
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Historical background and development 

Following Max Planck's quantization of light (see black body radiation), Albert Einstein interpreted Planck's 
quantum to be photons, particles of light, and proposed that the energy of a photon is proportional to its frequency, 
one of the first signs of wave-particle duality. Since energy and momentum are related in the same way as frequency 
and wavenumber in special relativity, it followed that the momentum p of a photon is proportional to its wavenumber 
k. 

h fit 

p — — — nk 

A 

Louis de Broglie hypothesized that this is true for all particles, even particles such as electrons. Assuming that the 
waves travel roughly along classical paths, he showed that they form standing waves for certain discrete frequencies. 
These correspond to discrete energy levels, which reproduced the old quantum condition 

Following up on these ideas, Schrodinger decided to find a proper wave equation for the electron. He was guided by 
William R. Hamilton's analogy between mechanics and optics, encoded in the observation that the zero-wavelength 
limit of optics resembles a mechanical system — the trajectories of light rays become sharp tracks which obey 
Fermat's principle, an analog of the principle of least action J 6 ^ A modern version of his reasoning is reproduced in 
the next section. The equation he found is: 

d h 2 
ih^(x, t) = -^V 2 *(x, t) + V(x)*(x, t). 

Using this equation, Schrodinger computed the hydrogen spectral series by treating a hydrogen atom's electron as a 
wave W(x, t), moving in a potential well V, created by the proton. This computation accurately reproduced the energy 
levels of the Bohr model. 

However, by that time, Arnold Sommerfeld had refined the Bohr model with relativistic corrections ^ 
Schrodinger used the relativistic energy momentum relation to find what is now known as the Klein-Gordon 
equation in a Coulomb potential (in natural units): 

He found the standing waves of this relativistic equation, but the relativistic corrections disagreed with Sommerfeld's 
formula. Discouraged, he put away his calculations and secluded himself in an isolated mountain cabin with a 
lover. 1 

While at the cabin, Schrodinger decided that his earlier non-relativistic calculations were novel enough to publish, 
and decided to leave off the problem of relativistic corrections for the future. He put together his wave equation and 
the spectral analysis of hydrogen in a paper in 1926.^ 10 ^ The paper was enthusiastically endorsed by Einstein, who 
saw the matter-waves as an intuitive depiction of nature, as opposed to Heisenberg's matrix mechanics, which he 
considered overly formal.^ 1 ^ 

The Schrodinger equation details the behaviour of ip but says nothing of its nature. Schrodinger tried to interpret it as 

ri2i 

a charge density in his fourth paper, but he was unsuccessful. In 1926, just a few days after Schrodinger' s fourth 

ri3i 

and final paper was published, Max Born successfully interpreted xp as a probability amplitude. Schrodinger, 
though, always opposed a statistical or probabilistic approach, with its associated discontinuities — much like 
Einstein, who believed that quantum mechanics was a statistical approximation to an underlying deterministic 
theory — and never reconciled with the Copenhagen interpretation J- 14] 
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Derivation 

Short heuristic derivation 

Schrodinger's equation can be derived in the following short heuristic way. 
Assumptions 

1 . The total energy E of a particle is 

E = T + V=^- + V. 

This is the classical expression for a particle with mass m where the total energy E is the sum of the kinetic 
energy T, and the potential energy V (which can vary with position, and time), p and m are respectively the 
momentum and the mass of the particle. 

2. Einstein's light quanta hypothesis of 1905, which asserts that the energy E of a photon is proportional to the 
frequency v (or angular frequency, co = 2jtv) of the corresponding electromagnetic wave: 

E — hi/ — Tiuj , 

3. The de Broglie hypothesis of 1924, which states that any particle can be associated with a wave, and that the 
momentum p of the particle is related to the wavelength A (or wavenumber k) of such a wave by: 

p = X = hk 5 

Expressing p and k as vectors, we have 

p = Kk . 

4. The three assumptions above allow one to derive the equation for plane waves only. To conclude that it is true in 
general requires the superposition principle, and thus, one must separately postulate that the Schrodinger equation 
is linear. 

Expressing the wave function as a complex plane wave 

Schrodinger's idea was to express the phase of a plane wave as a complex phase factor: 

and to realize that since 
d 

— * = -im^ 
dt 

then 

d 

= Tiu)^ = ih—^ 
at 

and similarly since 

d 
OX 

and 

d 2 



dx 2 

we find: 

so that, again for a plane wave, he obtained: 
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And, by inserting these expressions for the energy and momentum into the classical formula we started with, we get 
Schrodinger's famed equation, for a single particle in the 3-dimensional case in the presence of a potential V: 

d h 2 
at 2m 

Versions 

There are several equations that go by Schrodinger's name: 
Time dependent equation 

This is the equation of motion for the quantum state. In the most general form, it is written: ^ 15 ^ 

ih^(x, t) = HV(x, t) 

where jj is a linear operator acting on the wavefunction For the specific case of a single particle in one 
dimension moving under the influence of a potential vJ 15 ^ 

t) = _ A- |!Ltt(x, t) + V(x)*(x, t) 

and the operator jj can be read off: 

For a particle in three dimensions, the only difference is more derivatives: 

d h 2 
ih— y, z, t) = -— V 2 *(x 5 y, z, t) + V(x, y, z)9(x, y, z, t) 

and for N particles, the difference is that the wavefunction is in 3,/V-dimensional configuration space, the space of all 
possible particle positions J 16 ^ 

This last equation is in a very high dimension, so that the solutions are not easy to visualize. 
Time independent equation 

This is the equation for the standing waves, the eigenvalue equation for . In abstract form, for a general quantum 
system, it is written: ^ 15 ^ 

= Eip. 

For a particle in one dimension, 

2 

But there is a further restriction — the solution must not grow at infinity, so that it has either a finite L -norm (if it is 

ri7i 

a bound state) or a slowly diverging norm (if it is part of a continuum). 
' 2 = / Mx)\ 2 dx. 



For example, when there is no potential, the equation reads:^ 18] 

-Eib = ir 

* 2m dx 2 
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which has oscillatory solutions for E > 0 (the are arbitrary constants): 

and exponential solutions for E < 0 

1>-\B\(x) = C ie V 2 ™|£|/& 2 * + C2e -sfi^EUtf* 
The exponentially growing solutions have an infinite norm, and are not physical. They are not allowed in a finite 
volume with periodic or fixed boundary conditions. 

For a constant potential V the solution is oscillatory for E > V and exponential for E < V, corresponding to energies 
which are allowed or disallowed in classical mechanics. Oscillatory solutions have a classically allowed energy and 
correspond to actual classical motions, while the exponential solutions have a disallowed energy and describe a small 
amount of quantum bleeding into the classically disallowed region, to quantum tunneling. If the potential V grows at 
infinity, the motion is classically confined to a finite region, which means that in quantum mechanics every solution 
becomes an exponential far enough away. The condition that the exponential is decreasing restricts the energy levels 
to a discrete set, called the allowed energies. 

Nonlinear equation 

ri9i 

The nonlinear Schrodinger equation is the partial differential equation (in dimensionless form) 

id t if; = - ^ + 
for the complex field ip(xj). 

ri9i 

This equation arises from the Hamiltonian 1 

H=jdx [^i 2 +^r 

with the Poisson brackets 

{^(x)^( y )} = {r(x) : r(y)} = o 

{ip*(x),ifj(y)} = iS(x-y). 
It must be noted that this is a classical field equation. Unlike its linear counterpart, it never describes the time 
evolution of a quantum state. 

Properties 

The Schrodinger equation has certain properties. 

Local conservation of probability 

The probability density of a particle is ^*(x 5 t) • The probability flux is defined as [in units of 

(probability )/(area x time)]: 

j = (**V* - = — Im (tf*V* ) . 

The probability flux satisfies the continuity equation: 

^P(x,i)+V-j = 0 

where P(z; 5 t) is the probability density [measured in units of (probability )/( volume)]. This equation is the 

mathematical equivalent of the probability conservation law. 
For a plane wave: 

(re, t) = Ae* kx -^ 
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j(x,t) = \Af™. 

m 

So that not only is the probability of finding the particle the same everywhere, but the probability flux is as expected 
from an object moving at the classical velocity p/m. The reason that the Schrodinger equation admits a probability 
flux is because all the hopping is local and forward in time. 

Relativity 

The Schrodinger equation does not take into account relativistic effects; as a wave equation, it is invariant under a 
Galilean transformation, but not under a Lorentz transformation. But in order to include relativity, the physical 
picture must be altered. 

The Klein-Gordon equation uses the relativistic mass-energy relation: 

E 2 =p 2 c 2 +m 2 c l 
to produce the differential equation: 

which is relativistically invariant. 

Solutions 

Some general techniques are: 

• Perturbation theory 

• The variational method 

• Quantum Monte Carlo methods 

• Density functional theory 

• The WKB approximation and semi-classical expansion 
In some special cases, special methods can be used: 

• List of quantum-mechanical systems with analytical solutions 

• Hartree-Fock method and post Hartree-Fock methods 

• Discrete delta-potential method 
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Dirac equation 

The Dirac equation is a relativistic quantum mechanical wave equation formulated by British physicist Paul Dirac 
in 1928. It provides a description of elementary spin-Yz particles, such as electrons, consistent with both the 
principles of quantum mechanics and the theory of special relativity. The equation demands the existence of 
antiparticles and actually predated their experimental discovery. This made the discovery of the positron, the 
antiparticle of the electron, one of the greatest triumphs of modern theoretical physics. 



Mathematical formulation 

The Dirac equation in the form originally proposed by Dirac is: 



3 



(3mc + 2^ ot-kPk c j *) = in — ^ — 



where 

m is the rest mass of the electron, 

c is the speed of light, 

p is the momentum operator, 

x and t are the space and time coordinates, 

h = hllit is the reduced Planck constant, also known as Dirac's constant. 
The new elements in this equation are the 4x4 matrices <2fc and (3 , and the four-component wavefunction %j) . The 
matrices are all Hermitian and have squares equal to the identity matrix: 

<*\ = P = h 

and they all mutually anticommute: {g^ 5 ^j} = Oand |a^ ; /?} = 0 . Explicitly, 
a { {3 = -j3a h 

where i and j are distinct and range from 1 to 3. These matrices, and the form of the wavefunction, have a deep 
mathematical significance. The algebraic structure represented by the Dirac matrices had been created some 50 years 
earlier by the English mathematician W. K. Clifford. In turn, Clifford's formulation had been based on the mid- 19th 
century work of the German mathematician Hermann Grassmann in his "Lineare Ausdehnungslehre" (Theory of 
Linear Extensions). The latter had been regarded as well-nigh incomprehensible by most of his contemporaries. The 
appearance of something so seemingly abstract, at such a late date, in such a direct physical manner, is one of the 
most remarkable chapters in the history of physics. 

The commutation rules are designed so that a solution of Dirac's equation will automatically also be a solution of 
((mc 2 ) 2 + ^( Pfe c) 2 )V = £;V 

k = l 

which is the relativistic energy-momentum equation. 
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Comparison with the Schrodinger equation 

The Dirac equation is superficially similar to the Schrodinger equation for a free particle: 

The left side represents the square of the momentum operator divided by twice the mass, which is the nonrelativistic 
kinetic energy. A relativistic generalization of this equation requires that space and time derivatives must enter 
symmetrically, as they do in the relativistic Maxwell equations — the derivatives must be of the same order in space 
and time. In relativity, the momentum and the energy are the space and time parts of a geometrical space-time 
vector, the 4-momentum, and they are related by the relativistically invariant relation 

7712 

& 2 2 2 

— - p =m c 

which says that the length of this vector is the rest mass m. Replacing E and p by {fi — and —ifiS? as Schrodinger 

dt 

theory requires, we get a relativistic equation: 

and the wave function 0 is a relativistic scalar: a complex number which has the same numerical value in all 
frames. Because the equation is second order in the time derivative, one must specify the initial values of not only 
0 , but also of dt<j) . This is normal for classical waves, where the initial conditions are the position and velocity. 
However, in quantum mechanics, the wavefunction is supposed to be the complete description. That is, just knowing 
the wavefunction should determine the future. 

In the Schrodinger theory, the probability density is given by the positive definite expression 

P = </>*</> 

and its current by 

ih 

J = -—(<f>*VJ>-<i>V<!>*) 

and the conservation of probability density has a local form: 

In a relativistic theory, the form of the probability density and the current must form a four vector, so the form of the 
probability density can be found from the current just by replacing \7 by d t 

ih 

p=—(d>*d t d>-<i>d t <j)*). 

Everything is relativistic now, but the probability density is not positive definite, because the initial values of both (j) 
and dt(j) can be freely chosen. This expression reduces to Schrodinger' s density and current for superpositions of 
positive frequency waves whose wavelength is long compared to the Compton wavelength, that is, for nonrelativistic 
motions. It reduces to a negative definite quantity for superpositions of negative frequency waves only. It mixes up 
both signs when forces which have an appreciable amplitude to produce relativistic motions are involved, at which 
point scattering can produce particles and antiparticles. 

Although it was not a successful description of a single particle, this equation is resurrected in quantum field theory, 
where it is known as the Klein-Gordon equation, and describes a relativistic spin-0 complex field. The non-positive 
probability density and current are the charge-density and current, while the particles are described by a 
mode-expansion. 

To interpret the Klein-Gordon equation as an equation for the probability amplitude for a single particle at a given 
position, negative frequency solutions must be interpreted as describing the particle travelling backwards in time, 
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propagating into the past. 

The equation with this interpretation does not predict the future from the present except in the nonrelativistic limit, 
rather it places a global constraint on the amplitudes. This can be used to construct a perturbation expansion with 
particles zipping backwards and forwards in time, the Feynman diagrams, but it does not allow a straightforward 
wavefunction description, since each particle has its own separate proper time. Ultimately, the entity the 
Klein-Gordon equation (and Dirac equation) acts on is a field, not a wavefunction. The fields are physical fields 
whose values are observables. They are not probability amplitudes. 

Dirac's coup 

Thinking in terms of wavefunctions rather than fields, Dirac reasoned that the necessary equation is first-order in 
both space and time. One could formally take the relativistic expression for the energy E = C\Jp 2 + m 2 c 2 i 

replace p by its operator equivalent, expand the square root in an infinite series of derivative operators, set up an 
eigenvalue problem, then solve the equation formally by iterations. Most physicists had little faith in such a process, 
even if it were technically possible. 

As the story goes, Dirac was staring into the fireplace at Cambridge, pondering this problem, when he hit upon the 
idea of taking the square root of the wave operator thus: 
1 ^2 

V ' - = ( Ad * + Bd v + Cd * + -Dd t ){Ad x + Bd y + Cd z + -Ddt). 

c z ot z c c 

On multiplying out the right side we see that, to get all the cross-terms such as d x d y to vanish, we must assume 
AB + BA = 0, ... 

with 

A 2 = B 2 = . . . = 1. 

Dirac, who had just then been intensely involved with working out the foundations of Heisenberg's matrix 
mechanics, immediately understood that these conditions could be met if A, B... are matrices, with the implication 
that the wave function has multiple components. This immediately explained the appearance of two-component wave 
functions in Pauli's phenomenological theory of spin, something that up until then had been regarded as mysterious, 
even to Pauli himself. However, one needs at least 4x4 matrices to set up a system with the properties desired — so 
the wave function had four components, not two, as in the Pauli theory. 

Given the factorization in terms of these matrices, one can now write down immediately an equation 

(Ad x + Bd y + Cd z + -Dd t )^ = Kip 

c 

with k to be determined. Applying again the matrix operator on either side yields 

{v 2 - = *v 

On taking k = mc/Ti we find that all the components of the wave function individually satisfy the relativistic 
energy-momentum relation. Thus the sought-for equation that is first-order in both space and time is 

1 TfiC 

(Ad x + Bd y + cd z + -Dd t - -r)ip = o. 

c h 

With (j4 ? 5, C) = i/3ak and D — (3 , we get the Dirac equation. 
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Comparison with the Pauli theory 

The necessity of introducing half-integral spin goes back experimentally to the results of the Stern-Gerlach 
experiment. A beam of atoms is run through a strong inhomogeneous magnetic field, which then splits into N parts 
depending on the intrinsic angular momentum of the atoms. It was found that for silver atoms, the beam was split in 
two — the ground state therefore could not be integral, because even if the intrinsic angular momentum of the atoms 

were as small as possible, 1, the beam would be split into 3 parts, corresponding to atoms with L =-1,0, and +1. 

l z 

The conclusion is that silver atoms have net intrinsic angular momentum of Pauli set up a theory which explained 
this splitting by introducing a two-component wave function and a corresponding correction term in the 
Hamiltonian, representing a semi-classical coupling of this wave function to an applied magnetic field, as so: 

2 



Here A** is the applied electromagnetic field, and the three sigmas are Pauli matrices, eis the charge of the 
particle, e.g. e = — eofor the electron. On squaring out the first term, a residual interaction with the magnetic field 
is found, along with the usual Hamiltonian of a charged particle interacting with an applied field: 

H = — [p- -A) + eA 0 - 



2m V c ) 2mc 
This Hamiltonian is now a 2 x 2 matrix, so the Schrodinger equation based on it, 

dd 

H<p = ih-£ 

must use a two-component wave function. Pauli had introduced the sigma matrices 




0 1\ (0 -i 
i 0 




as pure phenomenology — Dirac now had a theoretical argument that implied that spin was somehow the 
consequence of the marriage of quantum theory to relativity. 

The Pauli matrices share the same properties as the Dirac matrices — they are all Hermitian, square to 1, and 
anticommute. This allows one to immediately find a representation of the Dirac matrices in terms of the Pauli 
matrices: 



(0 <Tk\ 
' \<Tk 0 J 



The Dirac equation now may be written as an equation coupling two-component spinors: 



/ mc 2 ca-p\f <(> + \ _ d_ ( 0+\ 
\ca ■ p -m<?) \<t>_) ~ m dt \4>-J ■ 





Notice that on the diagonal we find the rest energy of the particle. If we set the momentum to zero — that is, bring the 
particle to rest — then we have 

( mc 2 

= V 0 

The equations for the individual two-spinors are now decoupled, and we see that the "top" and "bottom" two-spinors 
are individually eigenf unctions of the energy with eigenvalues equal to plus and minus the rest energy, respectively. 
The appearance of this negative energy eigenvalue is completely consistent with relativity. 

It should be strongly emphasized that this separation in the rest frame is not an invariant statement — the "bottom" 
two-spinor does not represent antimatter as such in general. The entire four-component spinor represents an 
irreducible whole — in general, states will have an admixture of positive and negative energy components. If we 
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couple the Dirac equation to an electromagnetic field, as in the Pauli theory, then the positive and negative energy 
parts will be mixed together, even if they are originally decoupled. Dirac's main problem was to find a consistent 
interpretation of this mixing. As we shall see below, it brings a new phenomenon into physics — matter/antimatter 
creation and annihilation. 



Covariant form and relativistic invariance 

The explicitly covariant form of the Dirac equation is (employing the Einstein summation convention): 

—thrf'd^ + mcij) — 0. 
In the above, 7^ are the Dirac matrices. -y°is Hermitian, and the j k are anti-Hermitian, with the definition 

7°=/? 



7 = 7 a . 



Thus 



7 = 



/ 1 0 

0 1 

0 0 

\ 0 0 



0 
0 

-1 

0 



\ 



0 
0 
0 

- 1 / 



,7 = 



/ 0 

0 
0 

V-i 



0 
0 

-1 
0 



0 
1 
0 
0 



0 
0 

0/ 



,7 2 = 



/ 0 
0 
0 



0 0 

0 i 

1 0 
0 0 



0 
0 
0 



/ 



,7 = 



\ 



0 
0 

-1 

0 



This may be summarized using the Minkowski metric on spacetime in the form 

where the bracket expression {a, b} means ab + ba , the anticommutator. These are the defining relations of a 

Clifford algebra over a pseudo-orthogonal 4-d space with metric signature (-| ) . (Note that one may also 

employ the metric form ( 1- ++) by multiplying all the gammas by a factor of { .) The specific Clifford algebra 

employed in the Dirac equation is known as the Dirac algebra. 

The Dirac equation may be interpreted as an eigenvalue expression, where the rest mass is proportional to an 
eigenvalue of the 4-momentum operator, the proportion being the speed of light in vacuo: 

In practice, physicists often use units of measure such that ft and c are equal to 1, known as "natural" units. The 
equation then takes the simple form 

(-ij fA d fl + m)i/; = 0 
or, if Feynman slash notation is employed, 

(-i$ + m)ip = 0. 

A fundamental theorem states that if two distinct sets of matrices are given that both satisfy the Clifford relations, 
then they are connected to each other by a similarity transformation: 

y = s~ l ^s. 

If in addition the matrices are all unitary, as are the Dirac set, then S itself is unitary; 

y = ifl<fu. 

The transformation U is unique up to a multiplicative factor of absolute value 1. Let us now imagine a Lorentz 
transformation to have been performed on the derivative operators, which form a covariant vector. For the operator 
7^ dp to remain invariant, the gammas must transform among themselves as a contravariant vector with respect to 
their spacetime index. These new gammas will themselves satisfy the Clifford relations, because of the orthogonality 
of the Lorentz transformation. By the fundamental theorem, we may replace the new set by the old set subject to a 
unitary transformation. In the new frame, remembering that the rest mass is a relativistic scalar, the Dirac equation 
will then take the form 



0 1 

0 0 

0 0 

1 0 
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(-irf'fU&p + ™)ip(x', t') = 0 
U^-i-ffy + m)U^(x',t') = 0. 

If we now define the transformed spinor 

if/ = Uip 

then we have the transformed Dirac equation 

H 7 ^ + m)VV, t f ) = 0. 
Thus, once we settle on a unitary representation of the gammas, it is final provided we transform the spinor 
according the unitary transformation that corresponds to the given Lorentz transformation. 

These considerations reveal the origin of the gammas in geometry, hearkening back to Grassmann's original 
motivation - they represent a fixed basis of unit vectors in spacetime. Similarly, products of the gammas such as 
7m T^represent oriented surface elements, and so on. With this in mind, we can find the form the unit volume 
element on spacetime in terms of the gammas as follows. By definition, it is 

v = ^wtVtV • 

For this to be an invariant, the epsilon symbol must be a tensor, and so must contain a factor of yjg , where g is the 
determinant of the metric tensor. Since this is negative, that factor is imaginary. Thus 
V = ij 7 7 7 . 

This matrix is given the special symbol 75, owing to its importance when one is considering improper 
transformations of spacetime, that is, those that change the orientation of the basis vectors. In the representation we 
are using for the gammas, it is 

75 = (/ 2 o 2 )- 

Also note that we could as easily have taken the negative square root of the determinant of g - the choice amounts to 
an initial handedness convention. 

Lorentz Invariance of the Dirac equation 

The Lorentz invariance of the Dirac equation follows from its co variant nature. 

Comparison with the Klein-Gordon equation 

Using the Feynman slash notation, the Klein-Gordon equation can be factored: 

0 = (d 2 + m 2 )^ = (f + m 2 )ip = (i$ + m)(-i$ + m)^ . 
The last factor is simply the Dirac equation. Hence any solution to the Dirac equation is automatically a solution to 
the Klein-Gordon equation: 

+ m)iP = 0^(d 2 + m 2 )V> = 0 . 
But the converse is not true; not all solutions to the Klein-Gordon equation solve the Dirac equation. 
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Adjoint equation and Dirac current 

By defining the adjoint spinor 

where ^ is the conjugate transpose of ip , and noticing that 

{7 M)t 7 o = yy, 

we obtain, by taking the Hermitian conjugate of the Dirac equation and multiplying from the right by -y 0 , the 
adjoint equation: 

j>{i-fdp + m) = 0 

where is understood to act to the left. Multiplying the Dirac equation by ^ from the left, and the adjoint 
equation by ij) from the right, and adding, produces the law of conservation of the Dirac current in covariant form: 
dp (f^ip) = 0. 

Now we see the great advantage of the first-order equation over the one Schrodinger had tried - this is the conserved 
current density required by relativistic in variance, only now its 0-component is positive definite: 

The Dirac equation and its adjoint are the Euler-Lagrange equations of the 4-d invariant action integral 
S = J Ld\ 

where d A x = dt dx dy dz , and the scalar L is the Dirac Lagrangian 

ih - 

L = m*H> - -(VYX^VO - (d^W) 
and for the purposes of variation, if) and are regarded as independent fields. The relativistic invariance also 
follows immediately from the variational principle. 

Coupling to an electromagnetic field 

To consider problems in which an applied electromagnetic field interacts with the particles described by the Dirac 
equation, one uses the correspondence principle, and takes over into the theory the corresponding expression from 
classical mechanics, whereby the total momentum of a charged particle in an external field is modified as so: 

P — > p — -A , 
c 

(where q is the charge of the particle; for example, q = — e < Ofor an electron). In natural units, the Dirac 
equation then takes the form 

+ iqAp) + m}^ = 0. 

The validity of this prescription has been confirmed experimentally with great precision. It is known as minimal 
coupling, and is found throughout particle physics. Indeed, while the introduction of the electromagnetic field in this 
way is essentially phenomenological in this context, it arises from a fundamental principle in quantum field theory. 

Now as stated above, the transformation U is defined only up to a phase factor e i0 . Also, the fundamental 
observable of the Dirac theory, the current, is unchanged if we multiply the field (which is not a wavefunction) by an 
arbitrary phase. Because the field is not a wavefunction, this phase invariance has a different physical meaning from 
the phase invariance of probability amplitudes. We may exploit this to get the form of the mutual interaction of a 
Dirac particle and the electromagnetic field, as opposed to simply considering a Dirac particle in an applied field, by 
assuming this arbitrary phase factor to depend continuously on position: 

Notice now that 
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To preserve minimal coupling, we must add to the potential a term proportional to the gradient of the phase. But we 
know from electrodynamics that this leaves the electromagnetic field itself invariant. The value of the phase is 
arbitrary, but not how it changes from place to place. This is the starting point of gauge theory, which is the main 
principle on which quantum field theory is based. The simplest such theory, and the one most thoroughly 
understood, is known as quantum electrodynamics. The equations of field theory thus have invariance under both 
Lorentz transformations and gauge transformations. 

Curved spacetime Dirac equation 

The Dirac equation can be written in curved spacetime using vierbein fields. Vierbeins describe a local frame that 
enables to define Dirac matrices at every point. Contracting these matrices with vierbeins give the right 
transformation properties. This way Dirac's equation takes the following form in curved spacetime ^ : 

-vfe^D^ + m^f = 0. 
Here e^is the vierbein and D^is the covariant derivative for fermion fields, defined as follows 

where ^]ac is the Lorentzian metric, a ab is the commutator of Dirac matrices: 



ab 1 a b 

and is the spin connection: 

where is the Christoffel symbol. Note that here, Latin letters denote the "Lorentzian" indices and Greek ones 
denote "Riemannian" indices. 

Physical interpretation 

The Dirac theory, while providing a wealth of information that is accurately confirmed by experiments, nevertheless 
introduces a new physical paradigm that appears at first difficult to interpret and even paradoxical. Some of these 
issues of interpretation must be regarded as open questions. Here we will see how the Dirac theory brilliantly 
answered some of the outstanding issues in physics at the time it was put forward, while posing others that are still 
the subject of debate. 

Identification of observables 

The critical physical question in a quantum theory is - what are the physically observable quantities defined by the 
theory? According to general principles, such quantities are defined by Hermitian operators that act on the Hilbert 
space of possible states of a system. The eigenvalues of these operators are then the possible results of measuring the 
corresponding physical quantity. In the Schrodinger theory, the simplest such object is the overall Hamiltonian, 
which represents the total energy of the system. If we wish to maintain this interpretation on passing to the Dirac 
theory, we must take the Hamiltonian to be 



H = 7 ° (mc 2 + c j2l k (Pk- ~A k ) c) + qA°. 

\ k=l C J 



This looks promising, because we see by inspection the rest energy of the particle and, in case A = 0> me energy 
of a charge placed in an electric potential qA°- What about the term involving the vector potential? In classical 
electrodynamics, the energy of a charge moving in an applied potential is 
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H = Cyj (p - ^A) 2 + m 2 c 2 + qA°. 

Thus the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart, and we must take great 
care to correctly identify what is an observable in this theory. Much of the apparent paradoxical behavior implied by 
the Dirac equation amounts to a misidentification of these observables. Let us now describe one such effect, (cont'd) 

History 

Since the Dirac equation was originally invented to describe the electron, we will generally speak of "electrons" in 
this article. The equation also applies to quarks, which are also elementary spin-^ particles. A modified Dirac 
equation can be used to approximately describe protons and neutrons, which are not elementary particles (they are 
made up of quarks), but have a net spin of V2. Another modification of the Dirac equation, called the Majorana 
equation, is thought to describe neutrinos — also spin- 1 /! particles. 

The Dirac equation describes the probability amplitudes for a single electron. This is a single-particle theory; in other 
words, it does not account for the creation and destruction of the particles, and for the ultimate need to switch from 
the Dirac equation for wavefunctions to the physically distinct Dirac equation for fields. It gives a good prediction of 
the magnetic moment of the electron and explains much of the fine structure observed in atomic spectral lines. It also 
explains the spin of the electron. Two of the four solutions of the equation correspond to the two spin states of the 
electron. The other two solutions make the peculiar prediction that there exist an infinite set of quantum states in 
which the electron possesses negative energy. This strange result led Dirac to predict, via a remarkable hypothesis 
known as "hole theory," the existence of particles behaving like positively-charged electrons. Dirac thought at first 
these particles might be protons. He was chagrined when the strict prediction of his equation (which actually 
specifies particles of the same mass as the electron) was verified by the discovery of the positron in 1932. When 
asked later why he hadn't actually boldly predicted the yet unfound positron with its correct mass, Dirac answered 
"Pure cowardice!" He shared the Nobel Prize anyway, in 1933. 

A similar equation for spin 3/2 particles is called the Rarita-Sch winger equation. 

Hole theory 

The negative E solutions found in the preceding section are problematic, for it was assumed that the particle has a 
positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negative-energy 
solutions. Since they exist, we cannot simply ignore them, for once we include the interaction between the electron 
and the electromagnetic field, any electron placed in a positive-energy eigenstate would decay into negative-energy 
eigenstates of successively lower energy by emitting excess energy in the form of photons. Real electrons obviously 
do not behave in this way. 

To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the vacuum is the 
many-body quantum state in which all the negative-energy electron eigenstates are occupied. This description of the 
vacuum as a "sea" of electrons is called the Dirac sea. Since the Pauli exclusion principle forbids electrons from 
occupying the same state, any additional electron would be forced to occupy a positive-energy eigenstate, and 
positive-energy electrons would be forbidden from decaying into negative-energy eigenstates. 

Dirac further reasoned that if the negative-energy eigenstates are incompletely filled, each unoccupied eigenstate - 
called a hole - would behave like a positively charged particle. The hole possesses a positive energy, since energy is 
required to create a particle-hole pair from the vacuum. As noted above, Dirac initially thought that the hole might 
be the proton, but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron, 
whereas the proton is over 1800 times heavier. The hole was eventually identified as the positron, experimentally 
discovered by Carl Anderson in 1932. 
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It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negative-energy electrons. The 
infinitely negative contributions from the sea of negative-energy electrons has to be canceled by an infinite positive 
"bare" energy and the contribution to the charge density and current coming from the sea of negative-energy 
electrons is exactly canceled by an infinite positive "jellium" background so that the net electric charge density of the 
vacuum is zero. In quantum field theory, a Bogoliubov transformation on the creation and annihilation operators 
(turning an occupied negative-energy electron state into an unoccupied positive energy positron state and an 
unoccupied negative-energy electron state into an occupied positive energy positron state) allows us to bypass the 
Dirac sea formalism even though, formally, it is equivalent to it. 

In certain applications of condensed matter physics, however, the underlying concepts of "hole theory" are valid. The 
sea of conduction electrons in an electrical conductor, called a Fermi sea, contains electrons with energies up to the 
chemical potential of the system. An unfilled state in the Fermi sea behaves like a positively-charged electron, 
though it is referred to as a "hole" rather than a "positron". The negative charge of the Fermi sea is balanced by the 
positively-charged ionic lattice of the material. 

Dirac bilinears 

There are five different (neutral) Dirac bilinear terms not involving any derivatives: 

• (S)calar: (scalar, P-even) 

• (P)seudoscalar: ^^ip (scalar, P-odd) 

• (V)ector: ip^if) (vector, P-odd) 

• (A)xial: ^^^y 5 ^ (vector, P-even) 

• (T)ensor: ^a^ijj (antisymmetric tensor, P-even), 

where G w = % - [y, y]and 7 5 = 75 = ^c^atVVT* = hVtV- 

A Dirac mass term is an S coupling. A Yukawa coupling may be S or P. The electromagnetic coupling is V. The 
weak interactions are V-A. 

See also 

• Bohr-Sommerfeld theory 

• Breit equation 

• Dirac field 

• Einstein-Maxwell-Dirac equations 

• Feynman checkerboard 

• Foldy-Wouthuysen transformation 

• Klein-Gordon equation 

• Quantum electrodynamics 

• Rarita-Schwinger equation 

• Theoretical and experimental justification for the Schrodinger equation 

• The Dirac Equation appears on the floor of Westminster Abbey. It appears on the plaque commemorating Paul 
Dirac's life which was inaugurated on November 13, 1995 . 
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Klein-Gordon equation 

The Klein-Gordon equation (Klein-Fock-Gordon equation or sometimes Klein-Gordon-Fock equation) is a 
relativistic version of the Schrodinger equation. 

It is the equation of motion of a quantum scalar or pseudoscalar field, a field whose quanta are spinless particles. It 
cannot be straightforwardly interpreted as a Schrodinger equation for a quantum state, because it is second order in 
time and because it does not admit a positive definite conserved probability density. Still, with the appropriate 
interpretation, it does describe the quantum amplitude for finding a point particle in various places, the relativistic 
wavefunction, but the particle propagates both forwards and backwards in time. Any solution to the Dirac equation is 
automatically a solution to the Klein-Gordon equation, but the converse is not true. 



Statement 

The Klein-Gordon equation is 



,2 J2 



?5? *-vV+- jr * = a 

It is most often written in natural units: 

The form is determined by requiring that plane wave solutions of the equation: 

obey the energy momentum relation of special relativity: 

- PflP r = E 2 -P 2 = u; 2 -k 2 = -k^ = m 2 
Unlike the Schrodinger equation, there are two values of uj for each k, one positive and one negative. Only by 
separating out the positive and negative frequency parts does the equation describe a relativistic wavefunction. For 
the time-independent case, the Klein-Gordon equation becomes 



v 2 - 



m 2 c 2 



V>(r) = 0 



which is the homogeneous screened Poisson equation. 



History 

The equation was named after the physicists Oskar Klein and Walter Gordon, who in 1927 proposed that it describes 
relativistic electrons. Although it turned out that the Dirac equation describes the spinning electron, the Klein 
Gordon equation correctly describes the spinless pion. The pion is a composite particle; no spinless elementary 
particles have yet been found, although the Higgs boson is theorized to exist as a spin-zero boson, according to the 
Standard Model. 

The Klein-Gordon equation was first considered as a quantum wave equation by Schrodinger in his search for an 
equation describing de Broglie waves. The equation is found in his notebooks from late 1925, and he appears to have 
prepared a manuscript applying it to the hydrogen atom. Yet, without taking into account the electron's spin, the 
Klein-Gordon equation predicts the hydrogen atom's fine structure incorrectly, including overestimating the overall 
magnitude of the splitting pattern by a factor of 4n/(2n — l)for the ft-th energy level. In January 1926, 
Schrodinger submitted for publication instead his equation, a non-relativistic approximation that predicts the Bohr 
energy levels of hydrogen without fine structure. 

In 1926, soon after the Schrodinger equation was introduced, Vladimir Fock wrote an article about its generalization 
for the case of magnetic fields, where forces were dependent on velocity, and independently derived this equation. 
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Both Klein and Fock used Kaluza and Klein's method. Fock also determined the gauge theory for the wave equation. 
The Klein-Gordon equation for a free particle has a simple plane wave solution. 

Derivation 

The non-relativistic equation for the energy of a free particle is 

p 2 

2m 

By quantizing this, we get the non-relativistic Schrodinger equation for a free particle, 
p 2 d 

where 

p = —ihV 

is the momentum operator ( \7 being the del operator). 

The Schrodinger equation suffers from not being relativistically covariant, meaning it does not take into account 
Einstein's special relativity. 

It is natural to try to use the identity from special relativity 
Vp 2 c 2 + m 2 c 4 = E 

for the energy; then, just inserting the quantum mechanical momentum operator, yields the equation 

d 



ut 

This, however, is a cumbersome expression to work with because the differential operator cannot be evaluated while 
under the square root sign. In addition, this equation, as it stands, is nonlocal. 

Klein and Gordon instead began with the square of the above identity, i.e. 

p 2 c 2 + m 2 c 4 = £ 2 
which, when quantized, gives 

{{-iHVfc 2 + m 2 c 4 )V = (ift^)V 

ot 

which simplifies to 

(dty 

Rearranging terms yields 



i a 2 



$ - v V + '^^^ = o 



2^2 



m c 



c 2 (dt) 2 ^ r h 2 

Since all reference to imaginary numbers has been eliminated from this equation, it can be applied to fields that are 
real valued as well as those that have complex values. 

Using the reciprocal of the Minkowski metric diag(— c 2 , 1, 1, 1) , we get 

777 2 f 2 

-rTd^iP + — = 0 
a 

in covariant notation. This is often abbreviated as 

{□ + ^ = o, 

where 

rnc 
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and 

n 1 a 2 - 
□ = v . 

c 2 dt 2 

This operator is called the d'Alembert operator. Today this form is interpreted as the relativistic field equation for a 
scalar (i.e. spin-0) particle. Furthermore, any solution to the Dirac equation (for a spin-one-half particle) is 
automatically a solution to the Klein-Gordon equation, though not all solutions of the Klein-Gordon equation are 
solutions of the Dirac equation. It is noteworthy that the Klein-Gordon equation is very similar to the Proca 
equation. 

Relativistic free particle solution 

The Klein-Gordon equation for a free particle can be written as 
with the same solution as in the non-relativistic case: 

V>(r, t) = e^ 1 -^ 
except with the constraint 

2 u 2 m 2 c 2 



c 2 h 2 " 

Just as with the non-relativistic particle, we have for energy and momentum: 

< P > = (VI - ihVty) = fik, 
(E) = (i>\ih^\l>) = hw. 

Except that now when we solve for k and oo and substitute into the constraint equation, we recover the relationship 
between energy and momentum for relativistic massive particles: 

(E) 2 = m 2 c 4 + <p) 2 c 2 . 

For massless particles, we may set m = 0 in the above equations. We then recover the relationship between energy 
and momentum for massless particles: 

(E) = (\p\)c 
Action 

The Klein-Gordon equation can also be derived from the following action 

where ij) is the Klein-Gordon field and m is its mass. The complex conjugate of ip is written ^.If the scalar 
field is taken to be real- valued, then ^ = ijj. 

From this we can derive the stress-energy tensor of the scalar field. It is 
i-2 
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Electromagnetic interaction 

There is a simple way to make any field interact with electromagnetism in a gauge invariant way: replace the 
derivative operators with the gauge co variant derivative operators. The Klein Gordon equation becomes: 

=-{d t - ieA 0 ) 2 (f> + {di - %eAif<f> = m 2 <f> 
in natural units, where A is the vector potential. While it is possible to add many higher order terms, for example, 

D^D^tp + AF» v D^D v {D a D a $) = 0 

these terms are not renormalizable in 3+1 dimensions. 

The field equation for a charged scalar field multiplies by i, which means the field must be complex. In order for a 
field to be charged, it must have two components that can rotate into each other, the real and imaginary parts. 

The action for a charged scalar is the covariant version of the uncharged action: 
S = f (drf* + ieA^*)(d v <S> - ieA v <\>)-rf v = f |D0| 2 

J x J X 

Gravitational interaction 

In general relativity, we include the effect of gravity and the Klein-Gordon equation becomes 
-1 m 2 c 2 

or equivalently 

^ 2 Jl 

o = -srvjjrf + 

id 2 r 2 
ft 

where g a P is the reciprocal of the metric tensor that is the gravitational potential field, 9 is the determinant of the 
metric tensor, is the covariant derivative and T a ^ is the Christoffel symbol that is the gravitational force field. 

References 

• Sakurai, J. J. (1967). Advanced Quantum Mechanics. Addison Wesley. ISBN 0-201-06710-2. 

• Davydov, A.S. (1976). Quantum Mechanics, 2nd Edition. Pergamon. ISBN 0-08-020437-6. 



External links 

• Weisstein, Eric W., "Klein-Gordon equation from Math World. 
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Einstein-Maxwell-Dirac equations 

Einstein-Maxwell-Dirac equations (EMD) are related to quantum field theory. The current Big Bang Model is a 
quantum field theory in a curved spacetime. Unfortunately, no such theory is mathematically well-defined; in spite 
of this, theoreticians claim to extract information from this hypothetical theory. On the other hand, the 
super-classical limit of the not mathematically well-defined QED in a curved spacetime is the mathematically 
well-defined Einstein-Maxwell-Dirac system. (One could get a similar system for the standard model.) As a super 
theory, EMD violates the positivity condition in the Penrose-Hawking Singularity Theorem. Thus, it is possible that 
there would be complete solutions without any singularities- Yau has in fact constructed some. Furthermore, it is 
known that the Einstein-Maxwell-Dirac system admits of solitonic solutions, i.e., classical electrons and photons. 
This is the kind of theory Einstein was hoping for. EMD is also a totally geometricized theory as a non-commutative 
geometry; here, the charge e and the mass m of the electron are geometric invariants of the non-commutative 
geometry analogous to pi. 

One way of trying to construct a rigorous QED and beyond is to attempt to apply the deformation quantization 
program to MD, and more generally, EMD. This would involve the following. 

Program for SCESM 

The Super-Classical Einstein-Standard Model: 

• 1. Extend Asymptotic Completeness, Global Existence and the Infrared Problem for the Maxwell-Dirac Equations 
to SCESM (Memoirs of the American Mathematical Society), by M. Flato, Jacques C. H. Simon, Erik Taflin 
(http://www.amazon.com/Asymptotic-Completeness-Existence-Maxwell-Dirac-Mathematical/dp/ 
0821806831/ref=sr_l_l?ie=UTF8&s=books&qid=1240926988&sr=l-l). 

• 2. Show that the positivity condition in the Penrose-Hawking singularity theorem is violated for the SCESM. 
Construct smooth solutions to SCESM having Dark Stars. See here: The Large Scale Structure of Space-Time by 
Stephen W. Hawking, G. F. R. Ellis (http://www.amazon.com/ 

Structure-Space-Time-Cambridge-Monographs-Mathematical/dp/0521099064/ref=pd_bbs_sr_lie=UTF8& 
s=books&qid=1240927769&sr=8-l) 

• 3. Follow three substeps 

• i. Derive approximate history of the universe from SCESM - both analytically and via computer simulation. 

• ii. Compare with ESM (the QSM in a curved space-time). 

• iii. Compare with observation. See: Cosmology by Steven Weinberg (http://www.amazon.com/ 
Cosmology-StevenWeinberg/dp/O^ 

• 4. Show that the solution space to SCESM, F, is a reasonable infinite dimensional super- sympletic manifold. See: 
Supersymmetry for Mathematicians: An Introduction (http://www.amazon.com/ 

Supersymmetry-Mathematicians-Introduction-Courant-Lecture/dp/082 1 835742/ref=sr_l_5 ?ie=UTF8& 
s=books&qid= 1 2408940 1 1 &sr= 1 -5 ) 

• 5. Apply deformation quantization to F to obtain mathematically rigorous definition of SQESM (quantum version 
of SCESM). See: 

Deformation Theory and Symplectic Geometry by Daniel Sternheimer, John Rawnsley (http://www.amazon.com/ 
Deformation-Symplectic-Geometry-Mathematical-Physics/ dp/ 0792345258/ ref=sr_l_l?ie=UTF8& s=books& 
qid=1240930131&sr=l-l) 

• 6. Derive history of the universe from SQESM and compare with observation. 
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Rigged Hilbert space 

In mathematics, a rigged Hilbert space (Gelfand triple, nested Hilbert space, equipped Hilbert space) is a 

construction designed to link the distribution and square-integrable aspects of functional analysis. Such spaces were 
introduced to study spectral theory in the broad sense. They can bring together the 'bound state' (eigenvector) and 
'continuous spectrum', in one place. 



Motivation 

Since a function such as 
x h-> e ix > 

which is in an obvious sense an eigenvector of the differential operator 

. d 
dx 

on the real line R, is not square-integrable for the usual Borel measure on R, this requires some way of stepping 
outside the strict confines of the Hilbert space theory. This was supplied by the apparatus of Schwartz distributions, 
and a generalized eigenfunction theory was developed in the years after 1950. 



Functional analysis approach 

The concept of rigged Hilbert space places this idea in abstract functional- analytic framework. Formally, a rigged 
Hilbert space consists of a Hilbert space H, together with a subspace O which carries a finer topology, that is one for 
which the natural inclusion 

$ C H 

is continuous. It is no loss to assume that O is dense in H for the Hilbert norm. We consider the inclusion of dual 
spaces H in O . The latter, dual to O in its 'test function' topology, is realised as a space of distributions or 
generalised functions of some sort, and the linear functionals on the subspace O of type 

<t> ^ <V 3 0) 

for v in H are faithfully represented as distributions (because we assume O dense). 
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Now by applying the Riesz representation theorem we can identify H with H. Therefore the definition of rigged 
Hilbert space is in terms of a sandwich: 

$ C H C ©*. 

The most significant examples are for which O is a nuclear space; this comment is an abstract expression of the idea 
that O consists of test functions and O* of the corresponding distributions. 

Formal definition (Gelfand triple) 

A rigged Hilbert space is a pair (H,0) with H a Hilbert space, O a dense subspace, such that O is given a 
topological vector space structure for which the inclusion map i is continuous. Identifying H with its dual space H* 9 
the adjoint to i is the map i* \ H = H* — > $* • The duality pairing between O and O has to be compatible with 
the inner product on H. (u,v)<f> x $>* = (it, v) H whenever u G <$> C if and y £ H = H* C <£* • 
The specific triple i/ ; $*} is often named the "Gelfand triple" (after the mathematician Israel Gelfand). 

Note that even though O is isomorphic to O if O is a Hilbert space in its own right, this isomorphism is not the same 
as the composition of the inclusion / with its adjoint /* 
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Quantum inverse scattering method 



Quantum inverse scattering method relates two different approaches: l)Inverse scattering transform is a method of 
solving classical integrable differential equations of evolutionary type. Important concept is Lax representation. 2) 
Bethe ansatz is a method of solving quantum models in one space and one time dimension. Quantum inverse 
scattering method starts by quantization of Lax representation and reproduce results of Bethe ansatz. Actually it 
permits to rewrite Bethe ansatz in a new form: algebraic Bethe ansatz. This led to further progress in understanding 
of Heisenberg model (quantum), quantum Nonlinear Schrodinger equation and Hubbard model. 

In mathematics, the quantum inverse scattering method is a method for solving integrable models in 1+1 
dimensions introduced by L. D. Faddeev in about 1979. 
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A quasi-Hopf algebra is a generalization of a Hopf algebra, which was defined by the Russian mathematician 
Vladimir Drinfeld in 1989. 



Quasi-Hopf algebra 



A quasi-Hopf algebra is a quasi-bialgebra Bj± = (w4. 
antihomomorphism S (antipode) of J[ such that 



., A, £, <I>) for which there exist a, (3 £ A. and a bijective 




for all a £ A. an d where 



and 



Y / S(P j )aQ j /3S(R j )=I. 



3 



where the expansions for the quantities $and c£ -1 are given by 



and 
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* _1 = E^®Qi®^ 

3 

As for a quasi-bialgebra, the property of being quasi-Hopf is preserved under twisting. 

Usage 

Quasi-Hopf algebras form the basis of the study of Drinfeld twists and the representations in terms of F-matrices 
associated with finite-dimensional irreducible representations of quantum affine algebra. F-matrices can be used to 
factorize the corresponding R-matrix. This leads to applications in Statistical mechanics, as quantum affine algebras, 
and their representations give rise to solutions of the Yang-Baxter equation, a solvability condition for various 
statistical models, allowing characteristics of the model to be deduced from its corresponding quantum affine 
algebra. The study of F-matrices has been applied to models such as the Heisenberg XXZ model in the framework of 
the algebraic Bethe ansatz. It provides a framework for solving two-dimensional integrable models by using the 
Quantum inverse scattering method. 
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Vol. 201, 2000 

Quasitriangular Hopf algebra 

In mathematics, a Hopf algebra, H, is quasitriangular^ if there exists an invertible element, R, of JJ ® H such 
that 

• R A(x) = (To R for all x £ H » where A is the coproduct on H, and the linear map 
T \H®H -> H®Hi& given by T(x ® y) = y <g> x , 

• (A ® l)(i?) = R 13 R 23 , 

• (1® A){R) = R 13 R 12 , 

where R l2 = <p 12 (R) , R 13 = <t>i 3 {R) , and R 23 = <j) 23 (R) , where 0 12 : H <g> H -> H <g> H ® H , 
013 :H®H^H(&H®H, and 023 : H ® H — > H ® H <E> H , are algebra morphisms determined 
by 

(fruia ® 6) = a ® b ® 1, 
0i 3 (a ® b) = a <g> 1 ® 6, 

023(a ® ft) = 1 ® a <8) 
is called the R-matrix. 

As a consequence of the properties of quasitriangularity, the R-matrix, R, is a solution of the Yang-Baxter equation 
(and so a module V of H can be used to determine quasi-invariants of braids, knots and links). Also as a consequence 
of the properties of quasitriangularity, (e ® = (1 <g> = 1 G ; moreover J? -1 = (£ ® 1)(J2), 
fl = (1 ® S^R- 1 ), and (5 ® = R . One may further show that the antipode S must be a linear 

isomorphism, and thus S A 2 is an automorphism. In fact, S A 2 is given by conjugating by an invertible element: 
S(x) = uxu -1 where u = m(S f <g> l)i? 21 (cf. Ribbon Hopf algebras). 

It is possible to construct a quasitriangular Hopf algebra from a Hopf algebra and its dual, using the Drinfel'd 
quantum double construction. 
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Twisting 

The property of being a quasi-triangular Hopf algebra is preserved by twisting via an invertible element 
F = ^1 / ® fi ^ A® A mat ^ e (g) id)p — ^oJ (g) e) F = land satisfying the cocycle condition 

i 

(F ® 1) o (A ® = (1 ® F) o (id <g> A)F 

Furthermore, " = is invertible and the twisted antipode is given by S'^a) = uS{a)u 1 , with the 

i 

twisted comultiplication, R-matrix and co-unit change according to those defined for the quasi-triangular Quasi-Hopf 
algebra. Such a twist is known as an admissible (or Drinfel'd) twist. 

Notes 

[1] Montgomery & Schneider (2002), p. 72 (http://books.google.com/books?id=I3IK9U5Co_0C&pg=PA72&dq=''Quasitriangular''). 
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Ribbon Hopf algebra 

A ribbon Hopf algebra (^4 ? m, A, it, £, S : 7Z : i/)is a quasitriangular Hopf algebra which possess an invertible 
central element v more commonly known as the ribbon element, such that the following conditions hold: 
v 2 = uS{u), S(u) = v, e{v) = 1 

a(i/) = {n 21 n 12 )-\v®v) 

where u = m(S ® id) (7^21 )• Note that the element u exists for any quasitriangular Hopf algebra, and uS(u) 

must always be central and satisfies 

S(uS(u)) = uS(u),e(uS(u)) = l,A(uS(u)) = (n 2l n l2 )~ 2 (uS(u) ® uS(u))> so that all that is 

required is that it have a central square root with the above properties. 
Here 

A is a vector space 

m is the multiplication map 777, ; A ® A — > A 
A is the co-product map A : A — > A ® A 
u is the unit operator u : C — > A 
£ is the co-unit operator £ ; ^4 — > C 
5 is the antipode S : A ^ A 
is a universal R matrix 
We assume that the underlying field K is C 
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See also 

• Quasitriangular Hopf algebra 

• Quasi-triangular Quasi-Hopf algebra 

References 

• Altschuler, D., Coste, A.: Quasi-quantum groups, knots, three-manifolds and topological field theory. Commun. 
Math. Phys. 150 1992 83-107 http://arxiv.org/pdf/hep-th/9202047 

• Chari, V.C., Pressley, A.: A Guide to Quantum Groups Cambridge University Press, 1994 ISBN 0-521-55884-0. 

• Vladimir Drinfeld, Quasi-Hopf algebras, Leningrad Math J. 1 (1989), 1419-1457 

• Shahn Majid : Foundations of Quantum Group Theory Cambridge University Press, 1995 

Quasi-triangular Quasi-Hopf algebra 

A quasi-triangular quasi-Hopf algebra is a specialized form of a quasi-Hopf algebra defined by the Ukrainian 
mathematician Vladimir Drinfeld in 1989. It is also a generalized form of a quasi-triangular Hopf algebra. 

A quasi-triangular quasi-Hopf algebra is a set 7Y^4 = (*4. 5 i?, A, £, where Bj± = (*4 5 A, £, <1>) is a 

quasi-Hopf algebra and R £ j\ g) known as the R-matrix, is an invertible element such that 

RA(a) = a o A(a)R, a <E A 

a\A®A^A®A 

x ® y — > y <g> x 
so that a is the switch map and 

(A g> id)R = $32ltfl3*r32^23$123 

(id ® A)R = $^3 1 1 i?i3$213^12*r2 1 3 
where $ abc = x a <g> x b <g> x c and $ 123 = Q = Xi ® x 2 ® x$ £ A® A® A . 

The quasi-Hopf algebra becomes triangular if in addition, i? 2 i R\2 — 1 • 

The twisting of l~Lj^ by F £ ^4 ® A i s me same as for a quasi-Hopf algebra, with the additional definition of the 
twisted ^-matrix 

A quasi-triangular (resp. triangular) quasi-Hopf algebra with $ = lis a quasi-triangular (resp. triangular) Hopf 
algebra as the latter two conditions in the definition reduce the conditions of quasi-triangularity of a Hopf algebra . 

Similarly to the twisting properties of the quasi-Hopf algebra, the property of being quasi-triangular or triangular 
quasi-Hopf algebra is preserved by twisting. 

See also 

• Quasitriangular Hopf algebra 

• Ribbon Hopf algebra 
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• J.M. Maillet and J. Sanchez de Santos, Drinfeld Twists and Algebraic Bethe Ansatz, Amer. Math. Soc. Transl. (2) 
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Grassmann algebra 

In mathematics, the exterior product or wedge product of vectors is an algebraic construction generalizing certain 
features of the cross product to higher dimensions. Like the cross product, and the scalar triple product, the exterior 
product of vectors is used in Euclidean geometry to study areas, volumes, and their higher-dimensional analogs. 
Also, like the cross product, the exterior product is alternating, meaning that u a u = 0 for all vectors u, or 
equivalently^ u a v = -v a u for all vectors u and v. In linear algebra, the exterior product provides an abstract 
algebraic manner for describing the determinant and the minors of a linear transformation that is basis-independent, 
and is fundamentally related to ideas of rank and linear independence. 

The exterior algebra (also known as the Grassmann algebra, after Hermann Grassmann ) of a given vector 
space V over a field K is the unital associative algebra A(V) generated by the exterior product. It is widely used in 
contemporary geometry, especially differential geometry and algebraic geometry through the algebra of differential 
forms, as well as in multilinear algebra and related fields. In terms of category theory, the exterior algebra is a type 
of functor on vector spaces, given by a universal construction. The universal construction allows the exterior algebra 
to be defined, not just for vector spaces over a field, but also for modules over a commutative ring, and for other 
structures of interest. The exterior algebra is one example of a bialgebra, meaning that its dual space also possesses a 
product, and this dual product is compatible with the wedge product. This dual algebra is precisely the algebra of 
alternating multilinear forms on V, and the pairing between the exterior algebra and its dual is given by the interior 
product. 



Motivating examples 
Areas in the plane 

2 

The Cartesian plane R is a vector space equipped with a basis 
consisting of a pair of unit vectors 



(a+c,b+d) 




The area of a parallelogram in terms of the 
determinant of the matrix of coordinates of two of 
its vertices. 



ex = (1,0), e 2 = (0,l). 
Suppose that 

v = viei + V2&2>> w = wi&i + w 2 e 2 

2 

are a pair of given vectors in R , written in components. There is a unique parallelogram having v and w as two of its 
sides. The area of this parallelogram is given by the standard determinant formula: 
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A = |det [v w] | = \v1W2 — v 2 w 1 \. 
Consider now the exterior product of v and w: 

v A w = (uiei + v 2 e 2 ) A (u>iei + w 2 e 2 ) 

= viwiei A ei + viw 2 ei A e 2 + v 2 wie 2 A ei + v 2 w 2 e 2 A e 2 

= (v 1 w 2 — ^2^1)^1 A e 2 

where the first step uses the distributive law for the wedge product, and the last uses the fact that the wedge product 
is alternating, and in particular e 2 a e 1 = -e 1 a e^. Note that the coefficient in this last expression is precisely the 
determinant of the matrix [v w]. The fact that this may be positive or negative has the intuitive meaning that v and w 
may be oriented in a counterclockwise or clockwise sense as the vertices of the parallelogram they define. Such an 
area is called the signed area of the parallelogram: the absolute value of the signed area is the ordinary area, and the 
sign determines its orientation. 

The fact that this coefficient is the signed area is not an accident. In fact, it is relatively easy to see that the exterior 
product should be related to the signed area if one tries to axiomatize this area as an algebraic construct. In detail, if 
A(v,w) denotes the signed area of the parallelogram determined by the pair of vectors v and w, then A must satisfy 
the following properties: 

1. A(a\,bw) = ab A(v,w) for any real numbers a and b, since rescaling either of the sides rescales the area by the 
same amount (and reversing the direction of one of the sides reverses the orientation of the parallelogram). 

2. A(v,v) = 0, since the area of the degenerate parallelogram determined by v (i.e., a line segment) is zero. 

3. A(w,v) = -A(v,w), since interchanging the roles of v and w reverses the orientation of the parallelogram. 

4. A(v + <zw,w) = A(v,w), since adding a multiple of w to v affects neither the base nor the height of the 
parallelogram and consequently preserves its area. 

5. A(e , e ) = 1, since the area of the unit square is one. 

With the exception of the last property, the wedge product satisfies the same formal properties as the area. In a 

certain sense, the wedge product generalizes the final property by allowing the area of a parallelogram to be 

compared to that of any "standard" chosen parallelogram. In other words, the exterior product in two-dimensions is a 

T31 

basis-independent formulation of area. 

Cross and triple products 

For vectors in R , the exterior algebra is closely related to the cross product and triple product. Using the standard 
basis {e 1? e 2 , e 3 }, the wedge product of a pair of vectors 

u = itiei + u 2 e 2 + u 3 e 3 

and 

v = vi&i + v 2 e 2 + v 3 e 3 

is 

u A v = {uiv 2 - u 2 v 1 )(e 1 A e 2 ) + (u 3 v 1 - uiv 3 ) (e 3 A + (u 2 v 3 - u 3 v 2 )(e 2 A e 3 ) 

2 3 

where {e 1 A e 2 , e 3 A e 2 A e 3 } is the basis for the three-dimensional space A (R ). This imitates the usual 
definition of the cross product of vectors in three dimensions. 

Bringing in a third vector 

w = W1B1 + w 2 e 2 + w 3 e 3 , 

the wedge product of three vectors is 

uAvAw = (uiv 2 w 3 +u 2 v 3 wi+u 3 viw 2 — uiv 3 w 2 — u 2 viw 3 —u 3 v 2 wi)(eiAe 2 Ae 3 ) 

3 3 

where e 1 A e 2 A e^ is the basis vector for the one-dimensional space A (R ). This imitates the usual definition of the 
triple product. 
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The cross product and triple product in three dimensions each admit both geometric and algebraic interpretations. 
The cross product uxv can be interpreted as a vector which is perpendicular to both u and v and whose magnitude is 
equal to the area of the parallelogram determined by the two vectors. It can also be interpreted as the vector 
consisting of the minors of the matrix with columns u and v. The triple product of u, v, and w is geometrically a 
(signed) volume. Algebraically, it is the determinant of the matrix with columns u, v, and w. The exterior product in 
three-dimensions allows for similar interpretations. In fact, in the presence of a positively oriented orthonormal 
basis, the exterior product generalizes these notions to higher dimensions. 

Formal definitions and algebraic properties 

The exterior algebra A(V) over a vector space V is defined as the quotient algebra of the tensor algebra by the 
two-sided ideal / generated by all elements of the form x <E> x such that x G V.^ Symbolically, 

A(V) := T(V)/I. 

The wedge product a of two elements of A(V) is defined by 

a A f3 = a® {3 (mod/). 
Anticommutativity of the wedge product 

The wedge product is alternating on elements of V, which means that x a x = 0 for all x G V. It follows that the 
product is also anticommutative on elements of V, for supposing that xjEV, 

0 = (x + y)A(x + y) = xAx + xAy + yAx + yAy = xAy + yAx 
whence 

x Ay = —y A x. 

More generally, ifx^x^ x^ are elements of V, and a is a permutation of the integers [1, ...,£], then 

2^(1) A x a( 2) A • • • A x a{k) = sgn(cr)x 1 A x 2 A • • • A x k , 

where sgn(a) is the signature of the permutation oP^ 

The exterior power 

k 

The Mi exterior power of V, denoted A (V), is the vector subspace of A(V) spanned by elements of the form 
A X2 A . . . A Xfc, Xi € V, i = 1, 2, . . . , fc. 

k 

If a G A (V), then a is said to be a &-multi vector. If, furthermore, a can be expressed as a wedge product of k 

k 

elements of V, then a is said to be decomposable. Although decomposable multivectors span A (V), not every 

k 4 
element of A (V) is decomposable. For example, in R , the following 2-multivector is not decomposable: 

a. = e\ A e 2 + e 3 A e 4 . 

(This is in fact a symplectic form, since a a a ^ 0 [6] ) 
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Basis and dimension 

If the dimension of Vis n and {e ,...,e } is a basis of V, then the set 
{e^ A e i2 A • • • A e ik | 1 < i\ < i 2 < - - ■ < t& < n} 

k 

is a basis for A (V). The reason is the following: given any wedge product of the form 
Vi A - - - A Vk 

then every vector v . can be written as a linear combination of the basis vectors e .; using the bilinearity of the wedge 

J 1 
product, this can be expanded to a linear combination of wedge products of those basis vectors. Any wedge product 

in which the same basis vector appears more than once is zero; any wedge product in which the basis vectors do not 

appear in the proper order can be reordered, changing the sign whenever two basis vectors change places. In general, 

the resulting coefficients of the basis vectors can be computed as the minors of the matrix that describes the vectors 

v. in terms of the basis e.. 

J i 

By counting the basis elements, the dimension of A (V) is the binomial coefficient C(n,k). In particular, A (V) = {0} 
for k > n. 

Any element of the exterior algebra can be written as a sum of multi vectors. Hence, as a vector space the exterior 
algebra is a direct sum 

A(V) = A°{V) © A}(V) © A 2 {V) © • • • © A n (V) 

(where by convention A°(V) = K and A l (V) = V), and therefore its dimension is equal to the sum of the binomial 
coefficients, which is 2 n . 

Rank of a multivector 

If a G A k (V) 9 then it is possible to express a as a linear combination of decomposable multivectors: 

a = a m + a P) + ... + a (s) 

where each is decomposable, say 

a W =af ) A-Aaf, 1 = 1,2,...,*. 

The rank of the multivector a is the minimal number of decomposable multivectors in such an expansion of a. This 
is similar to the notion of tensor rank. 

Rank is particularly important in the study of 2-multivectors (Sternberg 1974, §111.6) (Bryant et al. 1991). The rank 

of a 2-multi vector a can be identified with half the rank of the matrix of coefficients of a in a basis. Thus if e. is a 

i 

basis for V, then a can be expressed uniquely as 
a = ^ a ij e i A e j 

where a.. - -a., (the matrix of coefficients is skew- symmetric). The rank of the matrix a., is therefore even, and is 

ij Ji y 
twice the rank of the form a. 

In characteristic 0, the 2-multivector a has rank p if and only if 

a A — ■ A a ^ 0 
v 

and 

a A — ■ A a = 0. 
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Graded structure 

The wedge product of a &-multi vector with a /?-multi vector is a (&+/?)-multi vector, once again invoking bilinearity. 
As a consequence, the direct sum decomposition of the preceding section 

A(V) = A°(V) © A^V) © A 2 (V) © • • • © A n (V) 

gives the exterior algebra the additional structure of a graded algebra. Symbolically, 

(A k (V)) A {A P {V)) C A k+p (V). 

Moreover, the wedge product is graded anticommutative, meaning that if a G A k (V) and (3 G A P (V), then 

qA/? = (-1)^Aq. 

In addition to studying the graded structure on the exterior algebra, Bourbaki (1989) studies additional graded 
structures on exterior algebras, such as those on the exterior algebra of a graded module (a module that already 
carries its own gradation). 

Universal property 

Let V be a vector space over the field K. Informally, multiplication in A(V) is performed by manipulating symbols 
and imposing a distributive law, an associative law, and using the identity v a v = 0 for v E V. Formally, A(V) is the 
"most general" algebra in which these rules hold for the multiplication, in the sense that any unital associative 
^-algebra containing V with alternating multiplication on V must contain a homomorphic image of A(V). In other 
words, the exterior algebra has the following universal property: 

Given any unital associative 7^-algebra A and any TT-linear map j : V —> A such that j(v)j(v) = 0 for every v in V, then 
there exists precisely one unital algebra homomorphism/ : A(V) —> A such that/(v) = j(i(v)) for all v in V. 

V — A(V) 

: / 
+ 

To construct the most general algebra that contains V and whose multiplication is alternating on V, it is natural to 
start with the most general algebra that contains V, the tensor algebra T(V), and then enforce the alternating property 
by taking a suitable quotient. We thus take the two-sided ideal / in T(V) generated by all elements of the form v®v 
for v in V, and define A(V) as the quotient 

A(V)=T(V)/I 

(and use A as the symbol for multiplication in A(V)). It is then straightforward to show that A(V) contains V and 
satisfies the above universal property. 

As a consequence of this construction, the operation of assigning to a vector space V its exterior algebra A(V) is a 
functor from the category of vector spaces to the category of algebras. 

Rather than defining A(V) first and then identifying the exterior powers A k (V) as certain subspaces, one may 
alternatively define the spaces A k (V) first and then combine them to form the algebra A(V). This approach is often 
used in differential geometry and is described in the next section. 
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Generalizations 

Given a commutative ring R and an /^-module M, we can define the exterior algebra A(M) just as above, as a suitable 
quotient of the tensor algebra T(M). It will satisfy the analogous universal property. Many of the properties of A(M) 
also require that M be a projective module. Where finite-dimensionality is used, the properties further require that M 
be finitely generated and projective. Generalizations to the most common situations can be found in (Bourbaki 
1989). 

Exterior algebras of vector bundles are frequently considered in geometry and topology. There are no essential 
differences between the algebraic properties of the exterior algebra of finite-dimensional vector bundles and those of 
the exterior algebra of finitely-generated projective modules, by the Serre-Swan theorem. More general exterior 
algebras can be defined for sheaves of modules. 

Duality 

Alternating operators 

Given two vector spaces V and X, an alternating operator (or anti-symmetric operator) from to X is a multilinear 
map 

/ : V k -> X 

such that whenever v 1? ...,v^ are linearly dependent vectors in V, then 

f(v u ...,v k ) = 0 

A well-known example is the determinant, an alternating operator from (f^) n to K. 
The map 

w : V k -> A k (V) 

which associates to k vectors from V their wedge product, i.e. their corresponding ^-vector, is also alternating. In 
fact, this map is the "most general" alternating operator defined on V k : given any other alternating operator/: —> 
X, there exists a unique linear map cp: A k (V) —> X with/= cp o w. This universal property characterizes the space 

k 

A (V) and can serve as its definition. 
Alternating multilinear forms 

The above discussion specializes to the case when X = K, the base field. In this case an alternating multilinear 
function 

/ : V k -> K 

is called an alternating multilinear form. The set of all alternating multilinear forms is a vector space, as the sum 
of two such maps, or the product of such a map with a scalar, is again alternating. By the universal property of the 
exterior power, the space of alternating forms of degree k on V is naturally isomorphic with the dual vector space 
(A k V)*. If V is finite-dimensional, then the latter is naturally isomorphic to A k (V*). In particular, the dimension of 
the space of anti- symmetric maps from to K is the binomial coefficient n choose k. 

Under this identification, the wedge product takes a concrete form: it produces a new anti-symmetric map from two 
given ones. Suppose oo : V* -> K and x\ : V 71 —> K are two anti- symmetric maps. As in the case of tensor products of 
multilinear maps, the number of variables of their wedge product is the sum of the numbers of their variables. It is 
defined as follows: 

(k + m)\ 41 , 
uj A r) = ^— — -^-Alt(u; <g> rj) 
k\ ml 

where the alternation Alt of a multilinear map is defined to be the signed average of the values over all the 
permutations of its variables: 
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Alt(o;)(xi 5 . . . , x k ) = — ^ *g*(°) "fcr(i), ■ ■ ■ , 

This definition of the wedge product is well-defined even if the field K has finite characteristic, if one considers an 
equivalent version of the above that does not use factorials or any constants: 

u)Ar)(x u . . .,x fc+m ) = ^2 BP^Jw^i), . . . ,a; ff (jfc))i/(i ff ( H i), . . . , x 0 .( jfc+m )) ) 

<reSh kjTn 

where here 57* C 5 is the subset of (k,m) shuffles: permutations a of the set {1,2,..., k+m] such that o(l) < a(2) 

K,tTl Kitn ron 

< . . . < a(Jfc), and a(ife+l) < a(&+2)< . . . <a(Jfc+m). L J 
Bialgebra structure 

In formal terms, there is a correspondence between the graded dual of the graded algebra A(V) and alternating 
multilinear forms on V. The wedge product of multilinear forms defined above is dual to a coproduct defined on 
A(V), giving the structure of a coalgebra. 

The coproduct is a linear function A : A(V) — » A(V) ® A(V) given on decomposable elements by 

k 

A(x 1 A - ■ -Aijt) = ^ S s enW(^(i) A- ■ ■Ai ff ( p ))®(i ff ( p+ i) A - • -Ax^). 

For example, 

A(a?i) = 1 ® X! + Xi ® 1, 

A(xi A X2) = 1 ® (xi A X2) + xi (8) ^2 — X2 ® xi + (xi A X2) ® 1. 
This extends by linearity to an operation defined on the whole exterior algebra. In terms of the coproduct, the wedge 
product on the dual space is just the graded dual of the coproduct: 

(a A /?)(x! A . . . A x k ) = (a ® j3) {A(x 1 A ... A x fc )) 
where the tensor product on the right-hand side is of multilinear linear maps (extended by zero on elements of 
incompatible homogeneous degree: more precisely, cxa|3 = 8 o (a®|3) o A, where 8 is the counit, as defined 
presently). 

The counit is the homomorphism 8 : A(V) —> K which returns the 0-graded component of its argument. The 
coproduct and counit, along with the wedge product, define the structure of a bialgebra on the exterior algebra. 

With an antipode defined on homogeneous elements by S(x) = (-l) deg x x, the exterior algebra is furthermore a Hopf 
algebra. 1 

Interior product 

Suppose that V is finite-dimensional. If V* denotes the dual space to the vector space V, then for each a E V , it is 
possible to define an antiderivation on the algebra A(V), 

i a : A k V -> A k ~ l V. 

This derivation is called the interior product with a, or sometimes the insertion operator, or contraction by a. 

Suppose that w G A k V. Then w is a multilinear mapping of V* to K, so it is defined by its values on the &-fold 
Cartesian product V x V x ... x V . If u^, u^, are k-1 elements of V , then define 

{i a w)(u u u 2 ... , Ufe-i) = w(a, u u u 2 , . - - 5 Ufc-i). 

Additionally, let ij= 0 whenever /is a pure scalar (i.e., belonging to A°V). 
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Axiomatic characterization and properties 

The interior product satisfies the following properties: 

1 . For each k and each a E V , 

i a : A h V -> A^V. 

(By convention, A -1 = 0.) 

2. If v is an element of V ( = A 1 V), then i v = a(v) is the dual pairing between elements of V and elements of V*. 

3. For each a £ V , is a graded derivation of degree -1: 

i a (a A b) = (i a a) Ab+ (-l) dega a A (i a b). 
In fact, these three properties are sufficient to characterize the interior product as well as define it in the general 
infinite-dimensional case. 

Further properties of the interior product include: 

• i a o i a = 0. 

• hy. ° ip — —ip 0 in- 

Hodge duality 

Suppose that V has finite dimension n. Then the interior product induces a canonical isomorphism of vector spaces 

A k (V*) ® A n (V) -> A n - k (V). 

In the geometrical setting, a non-zero element of the top exterior power A n (V) (which is a one-dimensional vector 
space) is sometimes called a volume form (or orientation form, although this term may sometimes lead to 
ambiguity.) Relative to a given volume form a, the isomorphism is given explicitly by 

If, in addition to a volume form, the vector space V is equipped with an inner product identifying V with V , then the 
resulting isomorphism is called the Hodge dual (or more commonly the Hodge star operator) 

* : A k (V) -> A n - k (V). 

k k 

The composite of * with itself maps A (V) — » A (V) and is always a scalar multiple of the identity map. In most 
applications, the volume form is compatible with the inner product in the sense that it is a wedge product of an 
orthonormal basis of V. In this case, 

* o * : A k (V) -> A k (V) = 

where / is the identity, and the inner product has metric signature (p,q) — p plusses and q minuses. 

Inner product 

For V a finite-dimensional space, an inner product on V defines an isomorphism of V with V*, and so also an 
isomorphism of A^V with (A k V)*. The pairing between these two spaces also takes the form of an inner product. On 
decomposable &-multi vectors, 

(vi A - - - A VfaWi A • • • A w k ) = det((^ 5 u^-)), 
the determinant of the matrix of inner products. In the special case v. = w., the inner product is the square norm of the 
multivector, given by the determinant of the Gramian matrix (bv„ v.D). This is then extended bilinearly (or 

1 J k 

sesquilinearly in the complex case) to a non-degenerate inner product on A V. If e , /=l,2,...,/2, form an orthonormal 
basis of V, then the vectors of the form 

e h A - Ae ik: i x < • • • < i k , 

k 

constitute an orthonormal basis for A (V). 
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With respect to the inner product, exterior multiplication and the interior product are mutually adjoint. Specifically, 
for v G A k ~ l (V), w G A*(V), and iEV, 

(x Av,w) = (v, vw> 

where x G V is the linear functional defined by 

x\y) = (x,y) 

for all y G V. This property completely characterizes the inner product on the exterior algebra. 

Functoriality 

Suppose that V and W are a pair of vector spaces and / : V —> W is a linear transformation. Then, by the universal 
construction, there exists a unique homomorphism of graded algebras 

A(/) : A(V) -> A(W) 

such that 

A(/)| A i ( v) =f:V = A\V) ^W = A\W). 

In particular, A(f) preserves homogeneous degree. The ^-graded components of A(f) are given on decomposable 
elements by 

A(/)(x! A ... A Xk) = fix,) A ... A f(x k ). 

Let 

A fc (/) = A(f) AHv) : A k (V) -» A*(W). 

The components of the transformation A(&) relative to a basis of V and W is the matrix of k x k minors of /. In 
particular, if V = W and V is of finite dimension n, then A n (f) is a mapping of a one-dimensional vector space A n to 
itself, and is therefore given by a scalar: the determinant off. 

Exactness 

If 

is a short exact sequence of vector spaces, then 

0 -> A 1 (17) A A(f) -> A(V) -> A(W) -> 0 

is an exact sequence of graded vector spaces tl0] as is 

o->A(to ^A(y). [11] 

Direct sums 

In particular, the exterior algebra of a direct sum is isomorphic to the tensor product of the exterior algebras: 
A(V © W) = A(V) ® A(W). 

This is a graded isomorphism; i.e., 

A fc (y©W)= 0 A p (V) © A 9 (W0. 

Slightly more generally, if 

is a short exact sequence of vector spaces then A k ( V) has a filtration 
0 = F° C F 1 C ■ ■ ■ C F k C = A fc (V) 

with quotients : F"P+ l / F p = A fc_p (/7) <g> A P (W) • In particular, if U is 1-dimensional then 
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0 -> U <g> A*" 1 ^) -> A*(V) -> A fc (W) -> 0 

is exact, and if Wis 1 -dimensional then 

0 -> A*(tf) -> A*(V) -> A fe_1 (i7) 

is exact J 1 2 ^ 



The alternating tensor algebra 

ri3i 

If K is a field of characteristic 0, L then the exterior algebra of a vector space V can be canonically identified with 
the vector subspace of T(V) consisting of antisymmetric tensors. Recall that the exterior algebra is the quotient of 
T( V) by the ideal / generated by x ® x. 

Let T r ( V) be the space of homogeneous tensors of degree r. This is spanned by decomposable tensors 

Vi ® . . . (g> IV, u» G V. 
The antisymmetrization (or sometimes the skew-symmetrization) of a decomposable tensor is defined by 



1 X 

Alt^! (g) - ■ ■ (g) ty) — — sgn(cr)i; CT (i) (g> • • • <g) v^ T ) 



where the sum is taken over the symmetric group of permutations on the symbols { l,...,r}. This extends by linearity 
and homogeneity to an operation, also denoted by Alt, on the full tensor algebra T(V). The image Alt(T(V)) is the 
alternating tensor algebra, denoted A(V). This is a vector subspace of T(V), and it inherits the structure of a graded 
vector space from that on T(V). It carries an associative graded product § defined by 

i®s = Alt(*<g> s). 

Although this product differs from the tensor product, the kernel of Alt is precisely the ideal / (again, assuming that K 
has characteristic 0), and there is a canonical isomorphism 

A(V) = A(V). 
Index notation 

Suppose that V has finite dimension n, and that a basis e^ e^ of Vis given, then any alternating tensor t G A r (V) C 
T^V) can be written in index notation as 

t = t ili2 - ir e h (g> e i2 <g) » • • <g) e ir 
where t l \ "' V is completely antisymmetric in its indices. 

The wedge product of two alternating tensors t and s of ranks r and p is given by 



t®s = - — ^— ^ sgn(cr)f-( 1 )-"^W5 iCT ^ 1 ) - iCT ^)e il ® Bi a ® - - - <g) e ir 



The components of this tensor are precisely the skew part of the components of the tensor product s ® t, denoted by 
square brackets on the indices: 

The interior product may also be described in index notation as follows. Let f — ^*o*i-.*r-ibe an antisymmetric 

tensor of rank r. Then, for a G V , i t is an alternating tensor of rank r-1, given by 

a 

(tot)* 1 -**- 1 =r^a J -* f<1 - ir - 1 . 
i=o 

where n is the dimension of V. 
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Applications 

Linear geometry 

The decomposable vectors have geometric interpretations: the bi vector u Av represents the plane spanned by the 
vectors, "weighted" with a number, given by the area of the oriented parallelogram with sides u and v. Analogously, 
the 3 -vector u A v A w represents the spanned 3 -space weighted by the volume of the oriented parallelepiped with 
edges u, v, and w. 

Projective geometry 

k 

Decomposable ^-vectors in A V correspond to weighted ^-dimensional subspaces of V. In particular, the 

Grassmannian of ^-dimensional subspaces of V, denoted Gr 'AV), can be naturally identified with an algebraic 

k 

sub variety of the projective space P(A V). This is called the Pliicker embedding. 
Differential geometry 

The exterior algebra has notable applications in differential geometry, where it is used to define differential forms. A 
differential form at a point of a differentiable manifold is an alternating multilinear form on the tangent space at the 
point. Equivalently, a differential form of degree k is a linear functional on the k-th exterior power of the tangent 
space. As a consequence, the wedge product of multilinear forms defines a natural wedge product for differential 
forms. Differential forms play a major role in diverse areas of differential geometry. 

In particular, the exterior derivative gives the exterior algebra of differential forms on a manifold the structure of a 
differential algebra. The exterior derivative commutes with pullback along smooth mappings between manifolds, and 
it is therefore a natural differential operator. The exterior algebra of differential forms, equipped with the exterior 
derivative, is a differential complex whose cohomology is called the de Rham cohomology of the underlying 
manifold and plays a vital role in the algebraic topology of differentiable manifolds. 

Representation theory 

In representation theory, the exterior algebra is one of the two fundamental Schur functors on the category of vector 
spaces, the other being the symmetric algebra. Together, these constructions are used to generate the irreducible 
representations of the general linear group; see fundamental representation. 

Physics 

The exterior algebra is an archetypal example of a superalgebra, which plays a fundamental role in physical theories 
pertaining to fermions and super symmetry. For a physical discussion, see Grassmann number. For various other 
applications of related ideas to physics, see superspace and supergroup (physics). 

Lie algebra homology 

Let L be a Lie algebra over a field k, then it is possible to define the structure of a chain complex on the exterior 
algebra of L. This is a ^-linear mapping 

d : A P+1 L -> A P L 

defined on decomposable elements by 

1 

d(x 1 A- • -Axp+i) = — — ^2(-l) 3+£ + 1 [x j ,x £ ]Ax 1 A- • -AxjA- • -Ax^A- • -Ax p+1 . 
p + 1 j<£ 

The Jacobi identity holds if and only if 33 = 0, and so this is a necessary and sufficient condition for an 
anticommutative nonassociative algebra L to be a Lie algebra. Moreover, in that case AL is a chain complex with 
boundary operator 3. The homology associated to this complex is the Lie algebra homology. 
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Homological algebra 

The exterior algebra is the main ingredient in the construction of the Koszul complex, a fundamental object in 
homological algebra. 

History 

The exterior algebra was first introduced by Hermann Grassmann in 1844 under the blanket term of 
Ausdehnungslehre, or Theory of Extension} 14 ^ This referred more generally to an algebraic (or axiomatic) theory of 
extended quantities and was one of the early precursors to the modern notion of a vector space. Saint- Venant also 
published similar ideas of exterior calculus for which he claimed priority over Grassmann J 15 ^ 

The algebra itself was built from a set of rules, or axioms, capturing the formal aspects of Cay ley and Sylvester's 
theory of multi vectors. It was thus a calculus, much like the propositional calculus, except focused exclusively on 
the task of formal reasoning in geometrical terms J In particular, this new development allowed for an axiomatic 
characterization of dimension, a property that had previously only been examined from the coordinate point of view. 

ri7i 

The import of this new theory of vectors and multi vectors was lost to mid 19th century mathematicians, until 
being thoroughly vetted by Giuseppe Peano in 1888. Peano's work also remained somewhat obscure until the turn of 
the century, when the subject was unified by members of the French geometry school (notably Henri Poincare, Elie 
Cartan, and Gaston Darboux) who applied Grassmann's ideas to the calculus of differential forms. 

A short while later, Alfred North Whitehead, borrowing from the ideas of Peano and Grassmann, introduced his 
universal algebra. This then paved the way for the 20th century developments of abstract algebra by placing the 
axiomatic notion of an algebraic system on a firm logical footing. 

Notes 

[I] Provided the characteristic is different from 2. 

[2] Grassmann (1844) introduced these as extended algebras (cf. Clifford 1878). He used the word aufiere (literally translated as outer, or 

exterior) only to indicate the produkt he defined, which is nowadays conventionally called exterior product, probably to distinguish it from the 
outer product as defined in modern linear algebra. 

[3] This axiomatization of areas is due to Leopold Kronecker and Karl Weierstrass; see Bourbaki (1989, Historical Note). For a modern 
treatment, see MacLane & Birkhoff (1999, Theorem IX.2.2). For an elementary treatment, see Strang (1993, Chapter 5). 

[4] This definition is a standard one. See, for instance, MacLane & Birkhoff (1999). 

[5] A proof of this can be found in more generality in Bourbaki (1989). 

[6] See Sternberg (1964, §111.6). 

[7] See Bourbaki (1989, 111.7. 1), and MacLane & Birkhoff (1999, Theorem XVI.6.8). More detail on universal properties in general can be found 

in MacLane & Birkhoff (1999, Chapter VI), and throughout the works of Bourbaki. 
[8] Some conventions, particularly in physics, define the wedge product as 

u) A j) — Alt(u; ® rj). 

This convention is not adopted here, but is discussed in connection with alternating tensors. 

[9] Indeed, the exterior algebra of V is the enveloping algebra of the abelian Lie superalgebra structure on V. 

[10] This part of the statement also holds in greater generality if V and W are modules over a commutative ring: That A converts epimorphisms to 
epimorphisms. See Bourbaki (1989, Proposition 3, III.7.2). 

[II] This statement generalizes only to the case where V and W are projective modules over a commutative ring. Otherwise, it is generally not the 
case that A converts monomorphisms to monomorphisms. See Bourbaki (1989, Corollary to Proposition 12, III.7.9). 

[12] Such a filtration also holds for vector bundles, and projective modules over a commutative ring. This is thus more general than the result 

quoted above for direct sums, since not every short exact sequence splits in other abelian categories. 
[13] See Bourbaki (1989, III.7.5) for generalizations. 

[14] Kannenberg (2000) published a translation of Grassmann's work in English; he translated Ausdehnungslehre as Extension Theory. 
[15] J Itard, Biography in Dictionary of Scientific Biography (New York 1970-1990). 

[16] Authors have in the past referred to this calculus variously as the calculus of extension (Whitehead 1898; Forder 1941), or extensive algebra 

(Clifford 1878), and recently as extended vector algebra (Browne 2007). 
[17] Bourbaki 1989, p. 661. 
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Supergroup 

The concept of supergroup is a generalization of that of group. In other words, every group is a supergroup but not 
every supergroup is a group. A supergroup is like a Lie group in that there is a well defined notion of smooth 
function defined on them. However the functions may have even and odd parts. Moreover a supergroup has a super 
Lie algebra which plays a role similar to that of a Lie algebra for Lie groups in that they determine most of the 
representation theory and which is the starting point for classification. 

More formally, a Lie supergroup is a supermanifold G together with a multiplication morphism 
fi : G X G — >■ G , an inversion morphism { m Q _> Q and a unit morphism e : 1 — > G which makes G a 
group object in the category of supermanifolds. This means that, formulated as commutative diagrams, the usual 
associativity and inversion axioms of a group continue to hold. Since every manifold is a super manifold, a Lie 
supergroup generalises the notion of a Lie group. 

There are many possible supergroups. The ones of most interest in theoretical physics are the ones which extend the 
Poincare group or the conformal group. Of particular interest are the orthosymplectic groups Osp(N/M) and the 
superconformal groups SU(N/M). 

An equivalent algebraic approach starts from the observation that a super manifold is determined by its ring of 
supercommutative smooth functions, and that a morphism of super manifolds corresponds one to one with an algebra 
homomorphism between their functions in the opposite direction, i.e that the category of supermanifolds is opposite 
to the category of algebras of smooth graded commutative functions. Reversing all the arrows in the commutative 
diagrams that define a Lie supergroup then shows that functions over the supergroup have the structure of a 
Z 2 -graded Hopf algebra. Likewise the representations of this Hopf algebra turn out to be Z 2 -graded comodules. This 
Hopf algebra gives the global properties of the supergroup. 

There is another related Hopf algebra which is the dual of the previous Hopf algebra. It can be identified with the 
Hopf algebra of graded differential operators at the origin. It only gives the local properties of the symmetries i.e., it 
only gives information about infinitesimal supersymmetry transformations. The representations of this Hopf algebra 
are modules. Like in the non graded case, this Hopf algebra can be described purely algebraically as the universal 
enveloping algebra of the Lie superalgebra. 

In a similar way one can define an affine algebraic supergroup as a group object in the category of super algebraic 
affine varieties. An affine algebraic supergroup has a similar one to one relation to its Hopf algebra of super 
Polynomials. Using the language of schemes, which combines the geometric and algebraic point of view, algebraic 
supergroup schemes can be defined including super Abelian varieties. 
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Superalgebra 

In mathematics and theoretical physics, a superalgebra is a Z 2 -graded algebra. ^ That is, it is an algebra over a 
commutative ring or field with a decomposition into "even" and "odd" pieces and a multiplication operator that 
respects the grading. 

The prefix super- comes from the theory of supersymmetry in theoretical physics. Superalgebras and their 
representations, supermodules, provide an algebraic framework for formulating supersymmetry. The study of such 
objects is sometimes called super linear algebra. Superalgebras also play an important role in related field of 
supergeometry where they enter into the definitions of graded manifolds, supermanifolds and superschemes. 

Formal definition 

Let K be a fixed commutative ring. In most applications, K is a field such as R or C. 
A superalgebra over K is a ^-module A with a direct sum decomposition 

A — Aq® Ai 

together with a bilinear multiplication AxA^A such that 

AiAj C i4.£_|_j 
where the subscripts are read modulo 2. 

A superring, or Z 2 -graded ring, is a superalgebra over the ring of integers Z. 

The elements of A. are said to be homogeneous. The parity of a homogeneous element x, denoted by Ixl, is 0 or 1 
according to whether it is in A Q or A^. Elements of parity 0 are said to be even and those of parity 1 to be odd. If x 
and y are both homogeneous then so is the product xy and \xy\ = \x\ + \y\. 

An associative superalgebra is one whose multiplication is associative and a unital superalgebra is one with a 
multiplicative identity element. The identity element in a unital superalgebra is necessarily even. Unless otherwise 
specified, all superalgebras in this article are assumed to be associative and unital. 

A commutative superalgebra is one which satisfies a graded version of commutativity. Specifically, A is 
commutative if 

yx = (-l)l x H y lxy 
for all homogeneous elements x and y of A. 

Examples 

• Any algebra over a commutative ring K may be regarded as a purely even superalgebra over K; that is, by taking 
A 1 to be trivial. 

• Any Z or N-graded algebra may be regarded as superalgebra by reading the grading modulo 2. This includes 
examples such as tensor algebras and polynomial rings over K. 

• In particular, any exterior algebra over K is a superalgebra. The exterior algebra is the standard example of a 
supercommutative algebra. 

• The symmetric polynomials and alternating polynomials together form a superalgebra, being the even and odd 
parts, respectively. Note that this is a different grading from the grading by degree. 

• Clifford algebras are superalgebras. They are generally noncommutative. 

• The set of all endomorphisms (both even and odd) of a super vector space forms a superalgebra under 
composition. 

• The set of all square supermatrices with entries in K forms a superalgebra denoted by M^(K). This algebra may 
be identified with the algebra of endomorphisms of a free supermodule over K of rank p\q. 
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• Lie superalgebras are a graded analog of Lie algebras. Lie superalgebras are nonunital and nonassociative; 
however, one may construct the analog of a universal enveloping algebra of a Lie superalgebra which is a unital, 
associative superalgebra. 

Further definitions and constructions 

A superalgebra is an algebra with a Z2 grading ("even" and "odd" elements) such that (i) the bracket of two 
generators is always antisymmetric except for two odd elements where it is symmetric and (ii) the Jacobi identities 
are satisfied 

[Ek, {0 fc , 0 6 }] = {[Ei, 0*], O b } + {[Ei, O b ], 0 fe } 
[o k , {o b , O a }] = [{O k , O k }, O a ] + [{0 fc , OJ, OJ 

The first of these three identities says that the 0 form a representation of the ordinary Lie algebra spanned by E 
(Consider the 0 as vectors on which the E act.) The second is equivalent to the first if the Killing form is nonsingular. 
The last identity restricts the possible representations 0 of the ordinary Lie algebra. This relation is the reason that 
not every ordinary Lie algebra can be extended to a superalgebra. 

Even subalgebra 

Let A be a superalgebra over a commutative ring K. The submodule A , consisting of all even elements, is closed 
under multiplication and contains the identity of A and therefore forms a subalgebra of A, naturally called the even 
subalgebra. It forms an ordinary algebra over K. 

The set of all odd elements A 1 is an A Q -bimodule whose scalar multiplication is just multiplication in A. The product 
in A equips A with a bilinear form 

fM : A 1 ® Ao M -> A 0 
such that 

fi(x <g> y) • z = x • fi(y <g> z) 
for all x, y, and z in A . This follows from the associativity of the product in A. 

Grade involution 

There is a canonical involutive automorphism on any superalgebra called the grade involution. It is given on 
homogeneous elements by 

x= {-lpx 

and on arbitrary elements by 

x = Xq — X\ 

where x are the homogeneous parts of x. If A has no 2-torsion (in particular, if 2 is invertible) then the grade 
involution can be used to distinguish the even and odd parts of A: 

Ai = {x G A : x = {-lfx}. 
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Supercommutativity 

The supercommutator on A is the binary operator given by 

[x,y] = xy-(-l)WMyx 
on homogeneous elements. This can be extended to all of A by linearity. Elements x and y of A are said to 
supercommute if [x, y] = 0. 

The supercenter of A is the set of all elements of A which supercommute with all elements of A : 

Z(A) = {a e A : [a, x] = 0 for all x <E A}. 

The supercenter of A is, in general, different than the center of A as an ungraded algebra. A commutative 
superalgebra is one whose supercenter is all of A. 

Super tensor product 

The graded tensor product of two superalgebras may be regarded as a superalgebra with a multiplication rule 
determined by: 

(ai ® 6i)(a 2 ® h) = (-l)^^^^ <g> bfa). 

Generalizations and categorical definition 

One can easily generalize the definition of superalgebras to include superalgebras over a commutative superring. The 
definition given above is then a specialization to the case where the base ring is purely even. 

Let R be a commutative superring. A superalgebra over R is a i?-supermodule A with a /^-bilinear multiplication A x 
A^> A that respects the grading. Bilinearity here means that 

r • (xy) = (r • x)y = (— l)^^^ • y) 

for all homogeneous elements r G R and x, y G A. 

Equivalently, one may define a superalgebra over R as a superring A together with an superring homomorphism R —> 
A whose image lies in the supercenter of A. 

One may also define superalgebras categorically. The category of all /?-supermodules forms a monoidal category 
under the super tensor product with R serving as the unit object. An associative, unital superalgebra over R can then 
be defined as a monoid in the category of /?-supermodules. That is, a superalgebra is an 7?-supermodule A with two 
(even) morphisms 

/i : A <g> A -> A 

rj : R —> A 

for which the usual diagrams commute. 
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Notes 

[1] Kac, Martinez & Zelmanov (2001), p. 3 (http://books.google.com/books ?id=jTCNZz2Tk4cC&pg=PA3&dq="superalgebra"). 
[2] P. van Nieuwenhuizen, Phys. Rep. 68, 189 (1981) 
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Supergravity 

In theoretical physics, supergravity (supergravity theory) is a field theory that combines the principles of 
supersymmetry and general relativity. Together, these imply that, in supergravity, the supersymmetry is a local 
symmetry (in contrast to non-gravitational supersymmetric theories, such as the Minimal Supersymmetric Standard 
Model (MSSM)). Since the generators of supersymmetry (SUSY) are convoluted with the Poincare group to form a 
Super-Poincare algebra it is very natural to see that SuperGravity follows naturally from super symmetry J ^ 

Gravitons 

Like any field theory of gravity, a supergravity theory contains a spin-2 field whose quantum is the graviton. 
Supersymmetry requires the graviton field to have a superpartner. This field has spin 3/2 and its quantum is the 
gravitino. The number of gravitino fields is equal to the number of supersymmetries. Supergravity theories are often 
said to be the only consistent theories of interacting massless spin 3/2 fields. 

History 

Four-dimensional SUGRA 

SUGRA, or SUper GRAvity, was initially proposed as a four-dimensional theory in 1976 by Daniel Z. Freedman, 
Peter van Nieuwenhuizen and Sergio Ferrara at Stony Brook University, but was quickly generalized to many 
different theories in various numbers of dimensions and greater number (N) of supersymmetry charges. Supergravity 
theories with N>1 are usually referred to as extended supergravity (SUEGRA). Some supergravity theories were 
shown to be equivalent to certain higher-dimensional supergravity theories via dimensional reduction (e.g. 1 11 
dimensional supergravity is dimensionally reduced on S7 to N = 8 d = 4 SUGRA). The resulting theories were 
sometimes referred to as Kaluza-Klein theories, as Kaluza and Klein constructed, nearly a century ago, a 
five-dimensional gravitational theory, that when dimensionally reduced on circle, its 4-dimensional non-massive 
modes describe electromagnetism coupled to gravity. 
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mSUGRA 

mSUGRA means minimal SUper GRAvity. The construction of a realistic model of particle interactions within the N 
= 1 supergravity framework where supersymmetry (SUSY) is broken by a super Higgs mechanism was carried out 
by Ali Chamseddine, Richard Arnowitt and Pran Nath in 1982. In these classes of models collectively now known as 
minimal supergravity Grand Unification Theories (mSUGRA GUT), gravity mediates the breaking of SUSY through 
the existence of a hidden sector. mSUGRA naturally generates the Soft SUSY breaking terms which are a 
consequence of the Super Higgs effect. Radiative breaking of electroweak symmetry through Renormalization 
Group Equations (RGEs) follows as an immediate consequence. mSUGRA is one of the most widely investigated 
models of particle physics due to it predictive power requiring only four input parameters and a sign, to determine 
the low energy Phenomenology from the scale of Grand Unification. 

lid: the maximal SUGRA 

One of these supergravities, the 11 -dimensional theory, generated considerable excitement as the first potential 
candidate for the theory of everything. This excitement was built on four pillars, two of which have now been largely 
discredited: 

• Werner Nahm showed that 1 1 dimensions was the largest number of dimensions consistent with a single graviton, 
and that a theory with more dimensions would also have particles with spins greater than 2. These problems are 
avoided in 12 dimensions if two of these dimensions are timelike, as has been often emphasized by Itzhak Bars. 

• In 1981, Ed Witten showed that 1 1 was the smallest number of dimensions that was big enough to contain the 
gauge groups of the Standard Model, namely SU(3) for the strong interactions and SU(2) times U(l) for the 
electroweak interactions. Today many techniques exist to embed the standard model gauge group in supergravity 
in any number of dimensions. For example, in the mid and late 1980s one often used the obligatory gauge 
symmetry in type I and heterotic string theories. In type II string theory they could also be obtained by 
compactifying on certain Calabi-Yau's. Today one may also use D-branes to engineer gauge symmetries. 

• In 1978, Eugene Cremmer, Bernard Julia and Joel Scherk (CJS) of the Ecole Normale Superieure found the 
classical action for an 11 -dimensional supergravity theory. This remains today the only known classical 

11 -dimensional theory with local supersymmetry and no fields of spin higher than two. Other 11 -dimensional 
theories are known that are quantum-mechanically inequivalent to the CJS theory, but classically equivalent (that 
is, they reduce to the CJS theory when one imposes the classical equations of motion). For example, in the mid 
1980s Bernard de Wit and Hermann Nicolai found an alternate theory in D=ll Supergravity with Local SU(8) 
Invariance . This theory, while not manifestly Lorentz-invariant, is in many ways superior to the CJS theory in 
that, for example, it dimensionally-reduces to the 4-dimensional theory without recourse to the classical equations 
of motion. 

• In 1980, Peter G. O. Freund and M. A. Rubin showed that compactification from 11 dimensions preserving all the 
SUSY generators could occur in two ways, leaving only 4 or 7 macroscopic dimensions (the other 7 or 4 being 
compact). Unfortunately, the noncompact dimensions have to form an anti de Sitter space. Today it is understood 
that there are many possible compactifications, but that the Freund-Rubin compactifications are invariant under 
all of the supersymmetry transformations that preserve the action. 

Thus, the first two results appeared to establish 11 dimensions uniquely, the third result appeared to specify the 
theory, and the last result explained why the observed universe appears to be four-dimensional. 

Many of the details of the theory were fleshed out by Peter van Nieuwenhuizen, Sergio Ferrara and Daniel Z. 
Freedman. 
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The end of the SUGRA era 

The initial excitement over 11 -dimensional supergravity soon waned, as various failings were discovered, and 
attempts to repair the model failed as well. Problems included: 

• The compact manifolds which were known at the time and which contained the standard model were not 
compatible with super- symmetry, and could not hold quarks or leptons. One suggestion was to replace the 
compact dimensions with the 7-sphere, with the symmetry group SO(8), or the squashed 7-sphere, with symmetry 
group SO(5) times SU(2). 

• Until recently, the physical neutrinos seen in the real world were believed to be massless, and appeared to be 
left-handed, a phenomenon referred to as the chirality of the Standard Model. It was very difficult to construct a 
chiral fermion from a compactification — the compactified manifold needed to have singularities, but physics 
near singularities did not begin to be understood until the advent of orbifold conformal field theories in the late 
1980s. 

• Supergravity models generically result in an unrealistically large cosmological constant in four dimensions, and 
that constant is difficult to remove, and so require fine-tuning. This is still a problem today. 

• Quantization of the theory led to quantum field theory gauge anomalies rendering the theory inconsistent. In the 
intervening years physicists have learned how to cancel these anomalies. 

Some of these difficulties could be avoided by moving to a 10-dimensional theory involving superstrings. However, 
by moving to 10 dimensions one loses the sense of uniqueness of the 11 -dimensional theory. 

The core breakthrough for the 10-dimensional theory, known as the first superstring revolution, was a demonstration 
by Michael B. Green, John H. Schwarz and David Gross that there are only three supergravity models in 10 
dimensions which have gauge symmetries and in which all of the gauge and gravitational anomalies cancel. These 
were theories built on the groups SO (3 2) and E% X Eg, the direct product of two copies of E . Today we know 
that, using D-branes for example, gauge symmetries can be introduced in other 10-dimensional theories as well. 

The second superstring revolution 

Initial excitement about the lOd theories, and the string theories that provide their quantum completion, died by the 
end of the 1980s. There were too many Calabi-Yaus to compactify on, many more than Yau had estimated, as he 
admitted in December 2005 at the 23rd International Solvay Conference in Physics. None quite gave the standard 
model, but it seemed as though one could get close with enough effort in many distinct ways. Plus no one understood 
the theory beyond the regime of applicability of string perturbation theory. 

There was a comparatively quiet period at the beginning of the 1990s, during which, however, several important 
tools were developed. For example, it became apparent that the various superstring theories were related by "string 
dualities", some of which relate weak string-coupling (i.e. perturbative) physics in one model with strong 
string-coupling (i.e. non-perturbative) in another. 

Then it all changed, in what is known as the second superstring revolution. Joseph Polchinski realized that obscure 
string theory objects, called D-branes, which he had discovered six years earlier, are stringy versions of the p-branes 
that were known in supergravity theories. The treatment of these p-branes was not restricted by string perturbation 
theory; in fact, thanks to supersymmetry, p-branes in supergravity were understood well beyond the limits in which 
string theory was understood. 

Armed with this new nonperturbative tool, Edward Witten and many others were able to show that all of the 
perturbative string theories were descriptions of different states in a single theory which he named M-theory. 
Furthermore he argued that the long wavelength limit of M-theory should be described by the 11 -dimensional 
supergravity that had fallen out of favor with the first superstring revolution 10 years earlier, accompanied by the 2- 
and 5-branes. [*= i.e. when the quantum wavelength associated to objects in the theory are much larger than the size 
of the 1 1th dimension]. 
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Historically, then, supergravity has come "full circle". It is a commonly used framework in understanding features of 
string theories, M-theory and their compactifications to lower spacetime dimensions. 

Relation to superstrings 

Particular 10-dimensional supergravity theories are considered "low energy limits" of the 10-dimensional superstring 
theories; more precisely, these arise as the massless, tree-level approximation of string theories. True effective field 
theories of string theories, rather than truncations, are rarely available. Due to string dualities, the conjectured 
11 -dimensional M-theory is required to have 11 -dimensional supergravity as a "low energy limit". However, this 
doesn't necessarily mean that string theory/M-theory is the only possible UV completion of supergravity; 
supergravity research is useful independent of those relations. 

4D/V= 1 SUGRA 

Before we move on to SUGRA proper, let's recapitulate some important details about general relativity. We have a 
4D differentiable manifold M with a Spin(3,l) principal bundle over it. This principal bundle represents the local 
Lorentz symmetry. In addition, we have a vector bundle T over the manifold with the fiber having four real 
dimensions and transforming as a vector under Spin(3,l). We have an invertible linear map from the tangent bundle 
TM to T. This map is the vierbein. The local Lorentz symmetry has a gauge connection associated with it, the spin 
connection. 

The following discussion will be in superspace notation, as opposed to the component notation, which isn't 
manifestly covariant under SUSY. There are actually many different versions of SUGRA out there which are 
inequivalent in the sense that their actions and constraints upon the torsion tensor are different, but ultimately 
equivalent in that we can always perform a field redefinition of the supervierbeins and spin connection to get from 
one version to another. 

In 4D N=l SUGRA, we have a 414 real differentiable supermanifold M, i.e. we have 4 real bosonic dimensions and 4 

real fermionic dimensions. As in the nonsupersymmetric case, we have a Spin(3,l) principal bundle over M. We 
414 

have an R vector bundle T over M. The fiber of T transforms under the local Lorentz group as follows; the four 
real bosonic dimensions transform as a vector and the four real fermionic dimensions transform as a Majorana 
spinor. This Majorana spinor can be reexpressed as a complex left-handed Weyl spinor and its complex conjugate 
right-handed Weyl spinor (they're not independent of each other). We also have a spin connection as before. 

We will use the following conventions; the spatial (both bosonic and fermionic) indices will be indicated by M, N, ... 
. The bosonic spatial indices will be indicated by \i, v, the left-handed Weyl spatial indices by a, (3,..., and the 
right-handed Weyl spatial indices by q , 0 , ... . The indices for the fiber of T will follow a similar notation, except 

that they will be hatted like this: , a. • See van der Waerden notation for more details. M = (/i, a, a) . The 

supervierbein is denoted by e ^ , and the spin connection by ^mnp- The inverse supervierbein is denoted by 

The supervierbein and spin connection are real in the sense that they satisfy the reality conditions 

ejf 0, Of = ejT (x : 0, 6) where /i* = fi , a * = a , and a* = a and u;(x, 0, 9)* = uj{x, 9 : d) 

The covariant derivative is defined as 

D^f = E%{d N f + u N \J]). 

The covariant exterior derivative as defined over supermanifolds needs to be super graded. This means that every 
time we interchange two fermionic indices, we pick up a +1 sign factor, instead of -1. 

The presence or absence of R symmetries is optional, but if R-symmetry exists, the integrand over the full 
superspace has to have an R-charge of 0 and the integrand over chiral superspace has to have an R-charge of 2. 
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A chiral superfield X is a superfield which satisfies D^X = 0. In order for this constraint to be consistent, we 
require the integrability conditions that = C ^^7^ or some coefficients c. 

Unlike nonSUSY GR, the torsion has to be nonzero, at least with respect to the fermionic directions. Already, even 
in flat superspace, D&e^ + D^e& 7^ 0. In one version of SUGRA (but certainly not the only one), we have the 

following constraints upon the torsion tensor: 

T\ = 0 

a/3 

2f- = 0 
T A . = 0 

if- = 0 
TP- = 0 

Here, Q, is a shorthand notation to mean the index runs over either the left or right Weyl spinors. 

The superdeterminant of the supervierbein, |e| , gives us the volume factor for M. Equivalently, we have the 

volume 414-superform e £=° /\ . . . /\ e £= 3 /\ e <*=l /\ e <*=2 ^ e ^=l ^ g a=2 . 

If we complexify the superdiffeomorphisms, there is a gauge where E~ = 0, E? = Oand £^ = The 
resulting chiral superspace has the coordinates x and 0. 

R is a scalar valued chiral superfield derivable from the supervielbeins and spin connection. If / is any superfield, 
(p 2 - 8R) f is always a chiral superfield. 

The action for a SUGRA theory with chiral superfields X, is given by 

+ c.c. 



S = J d*xd 2 Q2£ [| (D 2 - 8fl) e"* ( X ' X V 3 + W(X) 



where 7^ is the Kahler potential and W is the superpotential, and £ is the chiral volume factor. Unlike the case for 
flat superspace, adding a constant to either the Kahler or superpotential is now physical. A constant shift to the 
Kahler potential changes the effective Planck constant, while a constant shift to the superpotential changes the 
effective cosmological constant. As the effective Planck constant now depends upon the value of the chiral 
superfield X, we need to rescale the supervierbeins (a field redefinition) to get a constant Planck constant. This is 
called the Einstein frame. 
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Higher-dimensional SUGRA 

See the article higher-dimensional supergravity for more details. 
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Quantum statistical mechanics 

Quantum statistical mechanics is the study of statistical ensembles of quantum mechanical systems. A statistical 
ensemble is described by a density operator S, which is a non-negative, self-adjoint, trace-class operator of trace 1 on 
the Hilbert space H describing the quantum system. This can be shown under various mathematical formalisms for 
quantum mechanics. One such formalism is provided by quantum logic. 

Expectation 

From classical probability theory, we know that the expectation of a random variable X is completely determined by 
its distribution by 

Exppf) = / XdD x {X) 

assuming, of course, that the random variable is integrable or that the random variable is non-negative. Similarly, let 
A be an observable of a quantum mechanical system. A is given by a densely defined self-adjoint operator on H. The 
spectral measure of A defined by 

E A (U) = [ AdE(A), 
Ju 

uniquely determines A and conversely, is uniquely determined by A. is a boolean homomorphism from the Borel 
subsets of R into the lattice Q of self-adjoint projections of H. In analogy with probability theory, given a state S, we 
introduce the distribution of A under S which is the probability measure defined on the Borel subsets of R by 

D A (U) = Tr(E A (U)S). 

Similarly, the expected value of A is defined in terms of the probability distribution by 

Exp (4) = [ XdB A (X). 

Note that this expectation is relative to the mixed state S which is used in the definition of D^. 

Remark. For technical reasons, one needs to consider separately the positive and negative parts of A defined by the 
Borel functional calculus for unbounded operators. 

One can easily show: 

Exp(A) = Tr(AS) = Tr{SA). 

Note that if S is a pure state corresponding to the vector ip, 

Exp(^) = (1>\A\1>). 
Von Neumann entropy 

Of particular significance for describing randomness of a state is the von Neumann entropy of S formally defined by 
H(5) = -Tr(51og 2 5). 

Actually, the operator S log 2 S is not necessarily trace-class. However, if S is a non-negative self-adjoint operator not 
of trace class we define Tr(S) = +00. Also note that any density operator S can be diagonalized, that it can be 
represented in some orthonormal basis by a (possibly infinite) matrix of the form 



Ai 


0 •• 


• 0 


0 


A 2 •• 


• 0 


0 


0 ■■ 


■ K 
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and we define 

H(5) = -J]A i log 2 A i . 

i 

The convention is that 0 log 2 0 = 0, since an event with probability zero should not contribute to the entropy. This 

value is an extended real number (that is in [0, <»]) and this is clearly a unitary invariant of S. 

Remark. It is indeed possible that H(S) = +°« for some density operator S. In fact The the diagonal matrix 

n o 
0 



T = 



2(log 2 2)2 

0 



0 

1 



0 



3(log 2 3)2 

0 



n(log 2 n) 2 



T is non-negative trace class and one can show T log 2 T is not trace-class. 
Theorem. Entropy is a unitary invariant. 

In analogy with classical entropy (notice the similarity in the definitions), H(S) measures the amount of randomness 
in the state S. The more dispersed the eigenvalues are, the larger the system entropy. For a system in which the space 
H is finite-dimensional, entropy is maximized for the states S which in diagonal form have the representation 

r ± 0 ••• 0" 



0 I 



0 



0 0 ••• i 

n. 

For such an S, H(S) = log 2 n. The state S is called the maximally mixed state. 
Recall that a pure state is one of the form 

s = |v) (VI, 

for a vector of norm 1 . 

Theorem. R(S) = 0 if and only if S is a pure state. 

For S is a pure state if and only if its diagonal form has exactly one non-zero entry which is a 1 . 
Entropy can be used as a measure of quantum entanglement. 



Gibbs canonical ensemble 

Consider an ensemble of systems described by a Hamiltonian H with average energy E. If H has pure-point spectrum 
and the eigenvalues E n of H go to + °o sufficiently fast, e will be a non-negative trace-class operator for every 
positive r. 

The Gibbs canonical ensemble is described by the state 

where (3 is such that the ensemble average of energy satisfies 
Tt{SH) = E 

,and 

Tr(e-H = 2>-^ 

n 

is the quantum mechanical version of the canonical partition function. The probability that a system chosen at 
random from the ensemble will be in a state corresponding to energy eigenvalue is 
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Under certain conditions, the Gibbs canonical ensemble maximizes the von Neumann entropy of the state subject to 
the energy conservation requirement. 
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Quantum thermodynamics 

In the physical sciences, quantum thermodynamics is the study of heat and work dynamics in quantum systems. 
Approximately, quantum thermodynamics attempts to combine thermodynamics and quantum mechanics into a 
coherent whole. The essential point at which "quantum mechanics" began was when, in 1900, Max Planck outlined 
the "quantum hypothesis", i.e. that the energy of atomic systems can be quantized, as based on the first two laws of 
thermodynamics as described by Rudolf Clausius (1865) and Ludwig Boltzmann (1877).^ ^ See the history of 
quantum mechanics for a more detailed outline. 

Overview 

A central objective in quantum thermodynamics is the quantitative and qualitative determination of the laws of 
thermodynamics at the quantum level in which uncertainty and probability begin to take effect. A fundamental 
question is: what remains of thermodynamics if one goes to the extreme limit of small quantum systems having a 
few degrees of freedom? If thermodynamics applies at this level, does the second law of thermodynamics remain 
unchanged, or is there a more universal formulation than the many existing formulations, such as: the entropy of a 
closed system cannot decrease; heat flows from high to low temperature; systems evolve towards minimum potential 
energy wells; energy tends to dissipate; and so on. 
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External links 

• Quantum Thermodynamics and the Gibbs Paradox (http://staff.science.uva.nl/~nieuwenh/QL2L.html) 

• Quantum Thermodynamics (http://www.chaos.org.uk/~eddy/physics/heat.html) 

• On the Classical Limit of Quantum Thermodynamics in Finite Time (http://www.fh.huji.ac.il/~ronnie/ 
Paper s/ge va92.pdf) [PDF-format] 

• Quantum Thermodynamics (http://www.quantumthermodynamics.org) - list of good related articles 

Supertheory 

The theory of everything (TOE) is a putative theory of theoretical physics that fully explains and links together all 
known physical phenomena, and, ideally, has predictive power for the outcome of any experiment that could be 
carried out in principle. 

Initially, the term was used with an ironic connotation to refer to various overgeneralized theories. For example, a 
great-grandfather of Ijon Tichy — a character from a cycle of Stanislaw Lem's science fiction stories of the 
1960s — was known to work on the "General Theory of Everything". Physicist John Ellis ^ claims to have introduced 
the term into the technical literature in an article in Nature in 1986. Over time, the term stuck in popularizations of 
quantum physics to describe a theory that would unify or explain through a single model the theories of all 
fundamental interactions and of all particles of nature: general relativity for gravitation, and the standard model of 
elementary particle physics - which includes quantum mechanics - for electromagnetism, the two nuclear 
interactions, and the known elementary particles. 

There have been many theories of everything proposed by theoretical physicists over the twentieth century, but none 
have been confirmed experimentally. The primary problem in producing a TOE is that the accepted theories of 
quantum mechanics and general relativity are hard to combine. Their mutual incompatibility makes their unification 
a difficult task. The combination is one of the unsolved problems in physics. 

Based on theoretical holographic principle arguments from the 1990s, many physicists believe that 11 -dimensional 
M-theory, which is described in many sectors by matrix string theory, in many other sectors by perturbative string 
theory, is the complete theory of everything. However, there is no widespread consensus on this issue, because 
M-theory is not a completed theory but rather an approach for producing one. 

Historical antecedents 
Ancient Greece to Einstein 

Archimedes was possibly the first scientists to describe nature with axioms (or principles) and then to deduce new 
results from them. The putative theory of everything is expected to achieve the deduction of all phenomena from 
basic axioms. 

Since ancient Greek times, philosophers have speculated that the apparent diversity of appearances conceals an 
underlying unity, and thus that the list of forces might be short, indeed might contain only a single entry. For 
example, the mechanical philosophy of the 17th century posited that all forces could be ultimately reduced to contact 
forces between tiny solid particles. This was abandoned after the acceptance of Isaac Newton's long-distance force 
of gravity; but at the same time, Newton's work in his Principia provided the first dramatic empirical evidence for 
the unification of apparently distinct forces: Galileo's work on terrestrial gravity, Kepler's laws of planetary motion, 
and the phenomenon of tides were all quantitatively explained by a single law of universal gravitation. 

Building on these results, Laplace famously suggested that a sufficiently powerful intellect could, if it knew the 
position and velocity of every particle at a given time, along with the laws of nature, calculate the position of any 
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particle at any other time: 

An intellect which at a certain moment would know all forces that set nature in motion, and all positions of all 
items of which nature is composed, if this intellect were also vast enough to submit these data to analysis, it 
would embrace in a single formula the movements of the greatest bodies of the universe and those of the 
tiniest atom; for such an intellect nothing would be uncertain and the future just like the past would be present 
before its eyes. 

— Essai philosophique sur les probabilites, Introduction. 1814 

Modern quantum mechanics implies that uncertainty is inescapable, and thus that Laplace's vision cannot be correct. 
A theory of everything this must include quantum mechanics and gravitation. 

In 1820, Hans Christian 0rsted discovered a connection between electricity and magnetism, triggering decades of 
work that culminated in James Clerk Maxwell's theory of electromagnetism. Also during the 19th and early 20th 
centuries, it gradually became apparent that many common examples of forces — contact forces, elasticity, viscosity, 
friction, pressure — resulted from electrical interactions between the smallest particles of matter. 

In the late 1920s, the new quantum mechanics showed that the chemical bonds between atoms were examples of 
(quantum) electrical forces, justifying Dirac's boast that "the underlying physical laws necessary for the 
mathematical theory of a large part of physics and the whole of chemistry are thus completely known" ^ 

Attempts to unify gravity with electromagnetism date back at least to Michael Faraday's experiments of 1 849-50 
After Albert Einstein's theory of gravity (general relativity) was published in 1915, the search for a unified field 
theory combining gravity with electromagnetism began in earnest. At the time, it seemed plausible that no other 
fundamental forces exist. Prominent contributors were Gunnar Nordstrom, Hermann Weyl, Arthur Eddington, 
Theodor Kaluza, Oskar Klein, and most notably, many attempts by Einstein and his collaborators. In his last years, 
Albert Einstein was intensely occupied in finding such a unifying theory. None of these attempts was successful. ^ 

The nuclear interactions 

In the twentieth century, the search for a unifying theory was interrupted by the discovery of the strong and weak 
nuclear forces (or interactions), which differ both from gravity and from electromagnetism. A further hurdle was the 
acceptance that quantum mechanics had to be incorporated from the start, rather than emerging as a consequence of a 
deterministic unified theory, as Einstein had hoped. 

Gravity and electromagnetism could always peacefully coexist as entries in a list of classical forces, but for many 
years it seemed that gravity could not even be incorporated into the quantum framework, let alone unified with the 
other fundamental forces. For this reason, work on unification, for much of the twentieth century, focused on 
understanding the three "quantum" forces: electromagnetism and the weak and strong forces. The first two were 
combined in 1967-68 by Sheldon Glashow, Steven Weinberg, and Abdus Salam into the "electro weak" force. 
However, while the strong and electroweak forces peacefully coexist in the Standard Model of particle physics, they 
remain distinct. 

Electroweak unification is a broken symmetry: the electromagnetic and weak forces appear distinct at low energies 

2 2 

because the particles carrying the weak force, the W and Z bosons, with masses of 80.4 GeV/c and 91.2 GeV/c , 
whereas the photon, which carries the electromagnetic force, is massless. At higher energies Ws and Zs can be 
created easily and the unified nature of the force becomes apparent. 

Several Grand Unified Theories (GUTs) have been proposed to unify electromagnetism and the weak and strong 
forces. Grand unification is expected to set in at energies of the order of 10 16 GeV, far greater than could be reached 
by any possible Earth-based particle accelerator. Although the simplest GUTs have been experimentally ruled out, 
the general idea, especially when linked with supersymmetry, remains a favorite candidate in the theoretical physics 
community. 
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Modern physics 

The conventional pattern of theories 

A Theory of Everything would unify all the fundamental interactions of nature: gravitation, strong interaction, weak 
interaction, and electromagnetism. Because the weak interaction can transform elementary particles from one kind 
into another, the TOE should also yield a deep understanding of the various different kinds of possible particles. The 
usual assumed path of theories is given in the following graph, where each unification step leads one level higher: 

Theory of 
Everything 

Gravitation Electronuclear force 

(GUT) 

Strong Electroweak 
interaction force 
SU(3) SU(2) x U(l) 

Weak Electromagnetism 
interaction U(l) 
SU(2) 

Electricity Magnetism 

In this graph, electroweak unification occurs at around 100 GeV, grand unification is predicted to occur at 10 16 GeV, 

19 

and unification of the GUT force with gravity is expected at the Planck energy, roughly 10 GeV. 

In addition to the forces listed in the graph, a TOE must also explain the status of at least to candidate forces 
suggested by modern cosmology: an inflationary force and dark energy. Furthermore, cosmological experiments also 
suggest the existence of dark matter, supposedly composed of fundamental particles outside the scheme of the 
standard model. However, the existence of these forces and particles has not been proven yet. 

It may seem premature to be searching for a TOE when there is as yet no direct evidence for an electronuclear force, 
and while in any case there are many different proposed GUTs for this force. Nevertheless, most physicists believe 
that a GUT is possible, mainly due to the past history of convergence towards a single theory. Super symmetric 
GUTs seem plausible not only for their theoretical "beauty", but because they naturally produce large quantities of 
dark matter, and the inflationary force may be related to GUT physics (although it does not seem to form an 
inevitable part of the theory). Yet GUTs are clearly not the final answer. Both the current standard model and all 
proposed GUTs are quantum field theories which require the problematic technique of renormalization to yield 
sensible answers. This is usually regarded as a sign that these are only effective field theories, omitting crucial 
phenomena relevant only at very high energies. Furthermore, the inconsistency between quantum mechanics and 
general relativity implies that one or both of these must be replaced by a theory incorporating quantum gravity. 



String theory and M-theory 

The mainstream theory of everything at the moment is superstring theory / M-theory. These theories attempt to deal 
with the renormalization problem by setting up some lower bound on the length scales possible. 

String theories and supergravity (both believed to be limiting cases of the yet-to-be-defined M-theory) suppose that 
the universe actually has more dimensions than the easily observed three of space and one of time. The motivation 
behind this approach began with the Kaluza-Klein theory in which it was noted that applying general relativity to a 
five dimensional universe (with the usual four dimensions plus one small curled-up dimension) yields the equivalent 
of the usual general relativity in four dimensions together with Maxwell's equations (electromagnetism, also in four 
dimensions). This has led to efforts to work with theories with large number of dimensions in the hopes that this 
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would produce equations that are similar to known laws of physics. The notion of extra dimensions also helps to 
resolve the hierarchy problem, which is the question of why gravity is so much weaker than any other force. The 
common answer involves gravity leaking into the extra dimensions in ways that the other forces do not. 

In the late 1990s, it was noted that one problem with several of the candidates for theories of everything (but 

particularly string theory) was that they did not constrain the characteristics of the predicted universe. For example, 

many theories of quantum gravity can create universes with arbitrary numbers of dimensions or with arbitrary 

cosmological constants. Even the "standard" ten-dimensional string theory allows the "curled up" dimensions to be 

500 

compactified in an enormous number of different ways (one estimate is 10 ) each of which corresponds to a 
different collection of fundamental particles and low-energy forces. This array of theories is known as the string 
theory landscape. 

A speculative solution is that many or all of these possibilities are realised in one or another of a huge number of 
universes, but that only a small number of them are habitable, and hence the fundamental constants of the universe 
are ultimately the result of the anthropic principle rather than a consequence of the theory of everything. This 
anthropic approach is often criticised in that, because the theory is flexible enough to encompass almost any 
observation, it cannot make useful (as in original, falsifiable, and verifiable) predictions. In this view, string theory 
would be considered a pseudoscience, where an unfalsifiable theory is constantly adapted to fit the experimental 
results. 

Loop quantum gravity 

Current research on loop quantum gravity may eventually play a fundamental role in a TOE, but that is not its 
primary aim. Loop quantum gravity is facing difficulties in incorporating electromagnetism and the nuclear 
interactions. 

Other attempts 

Any TOE must include general relativity and the standard model of particle physics. Outside the previously 
mentioned attempts, the best-known one is Garrett Lisi's E8 proposal. 

Present status 

At present, no convincing candidate for a TOE is available. Most particle physicists tend to state that the outcome of 
the ongoing experiments at the large particle accelerators, the LCH and the Tevatron, are needed in order to provide 
theoreticians with precise input for such a theory. 

Arguments against a theory of everything 
Godel's incompleteness theorem 

A small number of scientists claim that Godel's incompleteness theorem proves that any attempt to construct a TOE 
is bound to fail. Godel's theorem, informally stated, asserts that any formal theory expressive enough for elementary 
arithmetical facts to be expressed and strong enough for them to be proved is either inconsistent (both a statement 
and its denial can be derived from its axioms) or incomplete, in the sense that there is a true statement about natural 
numbers that can't be derived in the formal theory. In his 1966 book The Relevance of Physics, Stanley Jaki pointed 
out that, because any "theory of everything" will certainly be a consistent non-trivial mathematical theory, it must be 
incomplete. He claims that this dooms searches for a deterministic theory of everything. 1 In a later reflection, Jaki 
states that it is wrong to say that a final theory is impossible, but rather that "when it is on hand one cannot know 
rigorously that it is a final theory." ^ 

Freeman Dyson has stated that 
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Godel's theorem implies that pure mathematics is inexhaustible. No matter how many problems we solve, there will always be other problems 
that cannot be solved within the existing rules. [...] Because of Godel's theorem, physics is inexhaustible too. The laws of physics are a finite 
set of rules, and include the rules for doing mathematics, so that Godel's theorem applies to them. 

— NYRB, May 13,2004 

Stephen Hawking was originally a believer in the Theory of Everything but, after considering Godel's Theorem, 
concluded that one was not obtainable. 

Some people will be very disappointed if there is not an ultimate theory, that can be formulated as a finite number of principles. I used to 
belong to that camp, but I have changed my mind. 

— Godel and the end of physics [1 1 \ July 20, 2002 

Jurgen Schmidhuber (1997) has argued against this view; he points out that Godel's theorems are irrelevant for 
computable physics. In 2000, Schmidhuber explicitly constructed limit-computable, deterministic universes 
whose pseudo-randomness based on undecidable, Godel-like halting problems is extremely hard to detect but does 
not at all prevent formal TOEs describable by very few bits of information J ^ 

Related critique was offered by Solomon FefermanJ 15 ^ among others. Douglas S. Robertson offers Conway's game 
of life as an example: ^ The underlying rules are simple and complete, but there are formally undecidable questions 
about the game's behaviors. Analogously, it may (or may not) be possible to completely state the underlying rules of 
physics with a finite number of well-defined laws, but there is little doubt that there are questions about the behavior 
of physical systems which are formally undecidable on the basis of those underlying laws. 

Since most physicists would consider the statement of the underlying rules to suffice as the definition of a "theory of 
everything", these researchers argue that Godel's Theorem does not mean that a TOE cannot exist. On the other 
hand, the physicists invoking Godel's Theorem appear, at least in some cases, to be referring not to the underlying 
rules, but to the understandability of the behavior of all physical systems, as when Hawking mentions arranging 
blocks into rectangles, turning the computation of prime numbers into a physical question. 1 This definitional 
discrepancy may explain some of the disagreement among researchers. 

Another approach to working with the limits of logic implied by Godel's incompleteness theorems is to abandon the 

n 8i 

attempt to model reality using a formal system altogether. Process Physics is a notable example of a candidate 
TOE that takes this approach, where reality is modeled using self-organizing (purely semantic) information. 

Fundamental limits in accuracy 

No physical theory to date is believed to be precisely accurate. Instead, physics has proceeded by a series of 
"successive approximations" allowing more and more accurate predictions over a wider and wider range of 
phenomena. Some physicists believe that it is therefore a mistake to confuse theoretical models with the true nature 
of reality, and hold that the series of approximations will never terminate in the "truth". Einstein himself expressed 
this view on occasions J 1 9 ^ On this view, we may reasonably hope for a theory of everything which self-consistently 
incorporates all currently known forces, but should not expect it to be the final answer. 

On the other hand it is often claimed that, despite the apparently ever-increasing complexity of the mathematics of 
each new theory, in a deep sense associated with their underlying gauge symmetry and the number of fundamental 
physical constants, the theories are becoming simpler. If so, the process of simplification cannot continue 
indefinitely. 
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Lack of fundamental laws 

There is a philosophical debate within the physics community as to whether a theory of everything deserves to be 
called the fundamental law of the universe P 0 ^ One view is the hard reductionist position that the TOE is the 
fundamental law and that all other theories that apply within the universe are a consequence of the TOE. Another 
view is that emergent laws (called "free floating laws" by Steven Weinberg), which govern the behavior of complex 
systems, should be seen as equally fundamental. Examples are the second law of thermodynamics and the theory of 
natural selection. The point being that, although in our universe these laws describe systems whose behaviour could 
("in principle") be predicted from a TOE, they would also hold in universes with different low-level laws, subject 
only to some very general conditions. Therefore it is of no help, even in principle, to invoke low-level laws when 
discussing the behavior of complex systems. Some argue that this attitude would violate Occam's Razor if a 
completely valid TOE were formulated. It is not clear that there is any point at issue in these debates (e.g., between 
Steven Weinberg and Philip Anderson) other than the right to apply the high-status word "fundamental" to their 
respective subjects of interest. 

Impossibility of being "of everything" 

Although the name "theory of everything" suggests the determinism of Laplace's quotation, this gives a very 
misleading impression. Determinism is frustrated by the probabilistic nature of quantum mechanical predictions, by 
the extreme sensitivity to initial conditions that leads to mathematical chaos, and by the extreme mathematical 
difficulty of applying the theory. Thus, although the current standard model of particle physics "in principle" predicts 
all known non-gravitational phenomena, in practice only a few quantitative results have been derived from the full 
theory (e.g., the masses of some of the simplest hadrons), and these results (especially the particle masses which are 
most relevant for low-energy physics) are less accurate than existing experimental measurements. The true TOE 
would almost certainly be even harder to apply. The main motive for seeking a TOE, apart from the pure intellectual 
satisfaction of completing a centuries-long quest, is that all prior successful unifications have predicted new 
phenomena, some of which (e.g., electrical generators) have proved of great practical importance. As in other cases 
of theory reduction, the TOE would also allow us to confidently define the domain of validity and residual error of 
low-energy approximations to the full theory which could be used for practical calculations. 

Theory of everything and philosophy 

The philosophical implication of a physical TOE are frequently debated. For example, if physicalism is true, a 
physical TOE will coincide with a philosophical theory of everything. Some philosophers (Aristotle, Plato, Hegel, 
Whitehead, et al.) have attempted to construct all-encompassing systems. Others are highly dubious about the very 
possibility of such an exercise. 

Stephen Hawking wrote in A Brief History of Time that even if we had a TOE, it would necessarily be a set of 
equations. He wrote, "What is it that breathes fire into the equations and makes a universe for them to describe?" P ^ 

While on his deathbed, Einstein still explored equations that he imagined to be candidates of a unified theory. Of 
course, the question would then be "why those equations?" One possible solution might be to adopt the point of view 
of ultimate ensemble, or modal realism, and say that those equations are not unique. Others doubt that the theory of 
everything will be in the form of equations at all. 
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Quantum algebra 

Quantum algebra is one of the top-level mathematics categories used by the arXiv. 
Subjects include: 

• Quantum groups 

• Skein theories 

• Operadic algebra 

• Diagrammatic algebra 

• Quantum field theory 

External links 

• Quantum algebra at arxiv.org ^ 
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Lie algebra 

In mathematics, a Lie algebra (pronounced /' li :/ ("lee"), not /'lai/ ("lye")) is an algebraic structure whose main use is 
in studying geometric objects such as Lie groups and differentiable manifolds. Lie algebras were introduced to study 
the concept of infinitesimal transformations. The term "Lie algebra" (after Sophus Lie) was introduced by Hermann 
Weyl in the 1930s. In older texts, the name "infinitesimal group" is used. 

Definition and first properties 

A Lie algebra is a vector space gover some field F together with a binary operation [•, •] 

M : S X fl ->fl 

called the Lie bracket, which satisfies the following axioms: 

• Bilinearity: 

[ax + by,z]= a[x, z] + %, z], [z, ax + by] = a[z, x] + b[z, y] 
for all scalars a, b in F and all elements x, y, z in 0 . 

• Alternating on Q : 

[x, x] = 0 

for all x in 0. This implies anticommutativity, or skew-symmetry (in fact the conditions are equivalent for any 
Lie algebra over any field whose characteristic is not 2): 

for all elements x, y in 0 . 

• The Jacobi identity: 
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fc, [y, z\] + [y, [z, x\] + [z, [x, y]] = 0 

for all x, y, z in 0 . 

For any associative algebra A with multiplication * , one can construct a Lie algebra L(A). As a vector space, L(A) is 
the same as A. The Lie bracket of two elements of L(A) is defined to be their commutator in A: 

[a, b] = a * b — b * a. 

The associativity of the multiplication * in A implies the Jacobi identity of the commutator in L{A). In particular, the 
associative algebra of n x n matrices over a field F gives rise to the general linear Lie algebra g[ n ( J F).The 
associative algebra A is called an enveloping algebra of the Lie algebra L(A). It is known that every Lie algebra can 
be embedded into one that arises from an associative algebra in this fashion. See universal enveloping algebra. 

Homomorphisms, subalgebras, and ideals 

The Lie bracket is not an associative operation in general, meaning that [[x, y], zjneed not equal [x, [y, z]]. 
Nonetheless, much of the terminology that was developed in the theory of associative rings or associative algebras is 
commonly applied to Lie algebras. A subspace f) C fjthat is closed under the Lie bracket is called a Lie 
subalgebra. If a subspace J C q satisfies a stronger condition that 

then / is called an ideal in the Lie algebra 0.^ A Lie algebra in which the commutator is not identically zero and 
which has no proper ideals is called simple. A homomorphism between two Lie algebras (over the same ground 
field) is a linear map that is compatible with the commutators: 

/:0^0', f([x,y]) = [f(x),f(y)], 

for all elements x and y in 0. As in the theory of associative rings, ideals are precisely the kernels of 
homomorphisms, given a Lie algebra 0and an ideal / in it, one constructs the factor algebra q/I , and the first 
isomorphism theorem holds for Lie algebras. Given two Lie algebras 0and g* , their direct sum is the vector space 
0 © fl' consisting of the pairs (x, x ! \ x £ 0, x ! G 0 ; , with the operation 

[(x,x') 5 (y,y')] = (kvL W,y'])i *>y£& *',y' £ a'- 
Examples 

• Any vector space V endowed with the identically zero Lie bracket becomes a Lie algebra. Such Lie algebras are 
called abelian, cf. below. Any one-dimensional Lie algebra over a field is abelian, by the antisymmetry of the Lie 
bracket. 

• The three-dimensional Euclidean space R with the Lie bracket given by the cross product of vectors becomes a 
three-dimensional Lie algebra. 

• The Heisenberg algebra is a three-dimensional Lie algebra with generators (see also the definition at Generating 
set): 

(010\ /000\ /001\ 

000, y=001, z=000, 
000/ \000/ \000/ 

whose commutation relations are 

It is explicitly exhibited as the space of 3x3 strictly upper- triangular matrices. 

• The subspace of the general linear Lie algebra gl n (F) consisting of matrices of trace zero is a subalgebra, ^ the 
special linear Lie algebra, denoted sl n (F). 
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• Any Lie group G defines an associated real Lie algebra q = Lie(G) . The definition in general is somewhat 
technical, but in the case of real matrix groups, it can be formulated via the exponential map, or the matrix 
exponent. The Lie algebra 0 consists of those matrices X for which 

exp(£X) G G 

for all real numbers t. The Lie bracket of 0is given by the commutator of matrices. As a concrete example, 
consider the special linear group SL(n,R), consisting of all nxn matrices with real entries and determinant 1. 
This is a matrix Lie group, and its Lie algebra consists of all n x n matrices with real entries and trace 0. 

• The real vector space of dXlnxn skew-hermitian matrices is closed under the commutator and forms a real Lie 
algebra denoted u(n) . This is the Lie algebra of the unitary group U(n). 

• An important class of infinite-dimensional real Lie algebras arises in differential topology. The space of smooth 
vector fields on a differentiable manifold M forms a Lie algebra, where the Lie bracket is defined to be the 
commutator of vector fields. One way of expressing the Lie bracket is through the formalism of Lie derivatives, 
which identifies a vector field X with a first order partial differential operator L acting on smooth functions by 

letting L if) be the directional derivative of the function /in the direction of X. The Lie bracket [X,Y] of two 

x 

vector fields is the vector field defined through its action on functions by the formula: 

L[X,Y]f = Lx(Lyf) - Ly(L X f). 

This Lie algebra is related to the pseudogroup of diffeomorphisms of M. 

• The commutation relations between the x, y, and z components of the angular momentum operator in quantum 
mechanics form a representation of a complex three-dimensional Lie algebra, which is the complexification of the 
Lie algebra so(3) of the three-dimensional rotation group: 

[L x , L y ] = ihL z 
[Lyj L z ] = ihL x 
[L z , LJ = ihLy 

• Kac-Moody algebra is an example of an infinite-dimensional Lie algebra. 

Structure theory and classification 

Every finite-dimensional real or complex Lie algebra has a faithful representation by matrices (Ado's theorem). Lie's 
fundamental theorems describe a relation between Lie groups and Lie algebras. In particular, any Lie group gives 
rise to a canonically determined Lie algebra (concretely, the tangent space at the identity), and conversely, for any 
Lie algebra there is a corresponding connected Lie group (Lie's third theorem). This Lie group is not determined 
uniquely, however, any two connected Lie groups with the same Lie algebra are locally isomorphic, and in 
particular, have the same universal cover. For instance, the special orthogonal group SO(3) and the special unitary 
group SU(2) give rise to the same Lie algebra, which is isomorphic to R with the cross-product, and SU(2) is a 
simply-connected twofold cover of SO(3). Real and complex Lie algebras can be classified to some extent, and this 
is often an important step toward the classification of Lie groups. 

Abelian, nilpotent, and solvable 

Analogously to abelian, nilpotent, and solvable groups, defined in terms of the derived subgroups, one can define 
abelian, nilpotent, and solvable Lie algebras. 

A Lie algebra 0is abelian if the Lie bracket vanishes, i.e. [x,y] = 0, for all x and y in 0. Abelian Lie algebras 
correspond to commutative (or abelian) connected Lie groups such as vector spaces J{ n or tori 7™, and are all of 
the form t™, meaning an ^-dimensional vector space with the trivial Lie bracket. 

A more general class of Lie algebras is defined by the vanishing of all commutators of given length. A Lie algebra 0 
is nilpotent if the lower central series 
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0 > [0>0] > [[0,0], 0] > [[[0 3 0]»0] 3 0] > 

becomes zero eventually. By Engel's theorem, a Lie algebra is nilpotent if and only if for every u in 0the adjoint 
endomorphism 

ad(it) : g — > g, did(u)v = [it, v] 
is nilpotent. 

More generally still, a Lie algebra 0is said to be solvable if the derived series: 

0> [0,0] > [[0,0], [0,0]] > [[[0,0], [0,0]], [[0,0], [0,0]]] > 

becomes zero eventually. 

Every finite-dimensional Lie algebra has a unique maximal solvable ideal, called its radical. Under the Lie 
correspondence, nilpotent (respectively, solvable) connected Lie groups correspond to nilpotent (respectively, 
solvable) Lie algebras. 

Simple and semisimple 

A Lie algebra is "simple" if it has no non-trivial ideals and is not abelian. A Lie algebra 0is called semisimple if its 
radical is zero. Equivalently, 0is semisimple if it does not contain any non-zero abelian ideals. In particular, a 
simple Lie algebra is semisimple. Conversely, it can be proven that any semisimple Lie algebra is the direct sum of 
its minimal ideals, which are canonically determined simple Lie algebras. 

The concept of semisimplicity for Lie algebras is closely related with the complete reducibility of their 
representations. When the ground field F has characteristic zero, semisimplicity of a Lie algebra 0over F is 
equivalent to the complete reducibility of all finite-dimensional representations of 0-An early proof of this 
statement proceeded via connection with compact groups (Weyl's unitary trick), but later entirely algebraic proofs 
were found. 

Classification 

In many ways, the classes of semisimple and solvable Lie algebras are at the opposite ends of the full spectrum of 
the Lie algebras. The Levi decomposition expresses an arbitrary Lie algebra as a semidirect sum of its solvable 
radical and a semisimple Lie algebra, almost in a canonical way. Semisimple Lie algebras over an algebraically 
closed field have been completely classified through their root systems. The classification of solvable Lie algebras is 
a 'wild' problem, and cannot be accomplished in general. 

Cartan's criterion gives conditions for a Lie algebra to be nilpotent, solvable, or semisimple. It is based on the notion 
of the Killing form, a symmetric bilinear form on 0 defined by the formula 

K(u,v) = tr(ad(w)ad(v)), 
where tr denotes the trace of a linear operator. A Lie algebra 0is semisimple if and only if the Killing form is 
nondegenerate. A Lie algebra 0is solvable if and only if K(q, [0,0]) =0. 

Relation to Lie groups 

Although Lie algebras are often studied in their own right, historically they arose as a means to study Lie groups. 
Given a Lie group, a Lie algebra can be associated to it either by endowing the tangent space to the identity with the 
differential of the adjoint map, or by considering the left-invariant vector fields as mentioned in the examples. This 
association is functorial, meaning that homomorphisms of Lie groups lift to homomorphisms of Lie algebras, and 
various properties are satisfied by this lifting: it commutes with composition, it maps Lie subgroups, kernels, 
quotients and cokernels of Lie groups to subalgebras, kernels, quotients and cokernels of Lie algebras, respectively. 

The functor which takes each Lie group to its Lie algebra and each homomorphism to its differential is a faithful and 
exact functor. This functor is not invertible; different Lie groups may have the same Lie algebra, for example SO(3) 
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and SU(2) have isomorphic Lie algebras. Even worse, some Lie algebras need not have any associated Lie group. 
Nevertheless, when the Lie algebra is finite-dimensional, there is always at least one Lie group whose Lie algebra is 
the one under discussion, and a preferred Lie group can be chosen. Any finite-dimensional connected Lie group has 
a universal cover. This group can be constructed as the image of the Lie algebra under the exponential map. More 
generally, we have that the Lie algebra is homeomorphic to a neighborhood of the identity. But globally, if the Lie 
group is compact, the exponential will not be injective, and if the Lie group is not connected, simply connected or 
compact, the exponential map need not be surjective. 

If the Lie algebra is infinite-dimensional, the issue is more subtle. In many instances, the exponential map is not even 
locally a homeomorphism (for example, in Dif^S 1 ), one may find diffeomorphisms arbitrarily close to the identity 
which are not in the image of exp). Furthermore, some infinite-dimensional Lie algebras are not the Lie algebra of 
any group. 

The correspondence between Lie algebras and Lie groups is used in several ways, including in the classification of 
Lie groups and the related matter of the representation theory of Lie groups. Every representation of a Lie algebra 
lifts uniquely to a representation of the corresponding connected, simply connected Lie group, and conversely every 
representation of any Lie group induces a representation of the group's Lie algebra; the representations are in one to 
one correspondence. Therefore, knowing the representations of a Lie algebra settles the question of representations 
of the group. As for classification, it can be shown that any connected Lie group with a given Lie algebra is 
isomorphic to the universal cover mod a discrete central subgroup. So classifying Lie groups becomes simply a 
matter of counting the discrete subgroups of the center, once the classification of Lie algebras is known (solved by 
Cartan et al. in the semisimple case). 



Category theoretic definition 

Using the language of category theory, a Lie algebra can be defined as an object A in Vec, the category of vector 
spaces together with a morphism [.,.]: A ® A — > A, where ® refers to the monoidal product of Vec, such that 

• [.,-]<> (id + T A , A )=0 

• [■,■] o ([-,•] ® id) o (id + <7 + <7 2 ) = 0 

where x(a® b) := b ® a and a is the cyclic permutation braiding (id ® x ) ° (x <E> id). In diagrammatic form: 



A A A A 




A A 
A AAA AAA AA 




A A A 
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Notes 

[1] Due to the anticommutativity of the commutator, the notions of a left and right ideal in a Lie algebra coincide. 
[2] Humphreys p. 2 
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Lie group 

In mathematics, a Lie group (pronounced /' li I/: similar to "Lee") is a group which is also a differentiable manifold, 
with the property that the group operations are compatible with the smooth structure. Lie groups are named after 
Sophus Lie, who laid the foundations of the theory of continuous transformation groups. 

Lie groups represent the best-developed theory of continuous symmetry of mathematical objects and structures, 
which makes them indispensable tools for many parts of contemporary mathematics, as well as for modern 
theoretical physics. They provide a natural framework for analysing the continuous symmetries of differential 
equations (Differential Galois theory), in much the same way as permutation groups are used in Galois theory for 
analysing the discrete symmetries of algebraic equations. An extension of Galois theory to the case of continuous 
symmetry groups was one of Lie's principal motivations. 
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Overview 
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The circle of center 0 and radius 1 in the complex 
plane is a Lie group with complex multiplication. 



Lie groups are smooth manifolds and, therefore, can be studied using 
differential calculus, in contrast with the case of more general 
topological groups. One of the key ideas in the theory of Lie groups, 
from Sophus Lie, is to replace the global object, the group, with its 
local or linearized version, which Lie himself called its "infinitesimal 
group" and which has since become known as its Lie algebra. 

Lie groups play an enormous role in modern geometry, on several 
different levels. Felix Klein argued in his Erlangen program that one 
can consider various "geometries" by specifying an appropriate 
transformation group that leaves certain geometric properties invariant. 
Thus Euclidean geometry corresponds to the choice of the group E(3) 
of distance-preserving transformations of the Euclidean space R , 
conformal geometry corresponds to enlarging the group to the 
conformal group, whereas in projective geometry one is interested in 

the properties invariant under the projective group. This idea later led to the notion of a G-structure, where G is a Lie 
group of "local" symmetries of a manifold. On a "global" level, whenever a Lie group acts on a geometric object, 
such as a Riemannian or a symplectic manifold, this action provides a measure of rigidity and yields a rich algebraic 
structure. The presence of continuous symmetries expressed via a Lie group action on a manifold places strong 
constraints on its geometry and facilitates analysis on the manifold. Linear actions of Lie groups are especially 
important, and are studied in representation theory. 

In the 1940s-1950s, Ellis Kolchin, Armand Borel and Claude Chevalley realised that many foundational results 
concerning Lie groups can be developed completely algebraically, giving rise to the theory of algebraic groups 
defined over an arbitrary field. This insight opened new possibilities in pure algebra, by providing a uniform 
construction for most finite simple groups, as well as in algebraic geometry. The theory of automorphic forms, an 
important branch of modern number theory, deals extensively with analogues of Lie groups over adele rings; p-adic 
Lie groups play an important role, via their connections with Galois representations in number theory. 



Definitions and examples 

A real Lie group is a group which is also a finite-dimensional real smooth manifold, and in which the group 
operations of multiplication and inversion are smooth maps. Smoothness of the group multiplication 

(i : G X G —> G fi(xj y) = xy 
means that \i is a smooth mapping of the product manifold GxG into G. These two requirements can be combined to 
the single requirement that the mapping 

be a smooth mapping of the product manifold into G. 
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First examples 

• The 2x2 real invertible matrices form a group under multiplication, denoted by GL 2 (R): 

GL 2 (R) = f^A = ^ ^ : det A = ad - be ^ 0 

This is a four-dimensional noncompact real Lie group. This group is disconnected; it has two connected 
components corresponding to the positive and negative values of the determinant. 

• The rotation matrices form a subgroup of GL 2 (R), denoted by S0 2 (R). It is a Lie group in its own right: 
specifically, a one-dimensional compact connected Lie group which is diffeomorphic to the circle. Using the 
rotation angle ^ as a parameter, this group can be parametrized as follows: 

S0 2 (R) = {( COS(P - sin ^W^/27rz). 
v ' y ysm (f cos if J 1 J 

Addition of the angles corresponds to multiplication of the elements of S0 2 (R), and taking the opposite angle 
corresponds to inversion. Thus both multiplication and inversion are differentiable maps. 

• The orthogonal group also forms an interesting example of a Lie group. 

All of the previous examples of Lie groups fall within the class of classical groups 

Related concepts 

A complex Lie group is defined in the same way using complex manifolds rather than real ones (example: SL 2 (C)), 
and similarly one can define a p-adic Lie group over the /?-adic numbers. Hilbert's fifth problem asked whether 
replacing differentiable manifolds with topological or analytic ones can yield new examples. The answer to this 
question turned out to be negative: in 1952, Gleason, Montgomery and Zippin showed that if G is a topological 
manifold with continuous group operations, then there exists exactly one analytic structure on G which turns it into a 
Lie group (see also Hilbert-Smith conjecture). If the underlying manifold is allowed to be infinite dimensional (for 
example, a Hilbert manifold) then one arrives at the notion of an infinite-dimensional Lie group. It is possible to 
define analogues of many Lie groups over finite fields, and these give most of the examples of finite simple groups. 

The language of category theory provides a concise definition for Lie groups: a Lie group is a group object in the 
category of smooth manifolds. This is important, because it allows generalization of the notion of a Lie group to Lie 
supergroups. 



More examples of Lie groups 

Lie groups occur in abundance throughout mathematics and physics. Matrix groups or algebraic groups are (roughly) 
groups of matrices (for example, orthogonal and symplectic groups), and these give most of the more common 
examples of Lie groups. 

Examples 

• Euclidean space with ordinary vector addition as the group operation becomes an ^-dimensional noncompact 
abelian Lie group. 

• The circle group S 1 consisting of angles mod lit under addition or, alternately, the complex numbers with 

absolute value 1 under multiplication. This is a one-dimensional compact connected abelian Lie group. 

2 

• The group GL^(R) of invertible matrices (under matrix multiplication) is a Lie group of dimension n , called the 
general linear group. It has a closed connected subgroup SL^(R), the special linear group, consisting of matrices 
of determinant 1 which is also a Lie group. 

• The orthogonal group O^(R), consisting of d\\nxn orthogonal matrices with real entries is an n(n - 
l)/2-dimensional Lie group. This group is disconnected, but it has a connected subgroup SO (R) of the same 
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dimension consisting of orthogonal matrices of determinant 1, called the special orthogonal group (for n = 3, the 
rotation group). 

• The Euclidean group E^(R) is the Lie group of all Euclidean motions, i.e., isometric affine maps, of 
^-dimensional Euclidean space R". 

• The unitary group U(n) consisting of n x n unitary matrices (with complex entries) is a compact connected Lie 

2 2 
group of dimension n . Unitary matrices of determinant 1 form a closed connected subgroup of dimension n - 1 

denoted S\J(n), the special unitary group. 

• Spin groups are double covers of the special orthogonal groups, used for studying fermions in quantum field 
theory (among other things). 

• The symplectic group Sp (R) consists of all 2n x 2n matrices preserving a nondegenerate skew- symmetric 

In 2 

bilinear form on R (the symplectic form). It is a connected Lie group of dimension 2n + n. The fundamental 
group of the symplectic group is Z and this fact is related to the theory of Maslov index. 

• The 3 -sphere S forms a Lie group by identification with the set of quaternions of unit norm, called versors. The 
only other spheres that admit the structure of a Lie group are the 0-sphere S° (real numbers with absolute value 1) 
and the circle S 1 (complex numbers with absolute value 1). For example, for even n > 1, S n is not a Lie group 

because it does not admit a nonvanishing vector field and so a fortiori cannot be parallelizable as a differentiable 

0 13 7 

manifold. Of the spheres only S , S , S , and S are parallelizable. The latter carries the structure of a Lie 
quasigroup (a nonassociative group), which can be identified with the set of unit octonions. 

• The group of upper triangular n by n matrices is a solvable Lie group of dimension nin + l)/2. 

• The Lorentz group and the Poincare group are the groups of linear and affine isometries of the Minkowski space 
(interpreted as the spacetime of the special relativity). They are Lie groups of dimensions 6 and 10. 

• The Heisenberg group is a connected nilpotent Lie group of dimension 3, playing a key role in quantum 
mechanics. 

• The group U(l)xSU(2)xSU(3) is a Lie group of dimension 1+3+8=12 that is the gauge group of the Standard 
Model in particle physics. The dimensions of the factors correspond to the 1 photon + 3 vector bosons + 8 gluons 
of the standard model. 

• The (3 -dimensional) metaplectic group is a double cover of SL 2 (R) playing an important role in the theory of 
modular forms. It is a connected Lie group that cannot be faithfully represented by matrices of finite size, i.e., a 
nonlinear group. 

• The exceptional Lie groups of types F 4? E , E , E^ have dimensions 14, 52, 78, 133, and 248. There is also a 
group E 71/2 of dimension 190. 

Constructions 

There are several standard ways to form new Lie groups from old ones: 

• The product of two Lie groups is a Lie group. 

• Any topologically closed subgroup of a Lie group is a Lie group. This is known as Cartan's theorem. 

• The quotient of a Lie group by a closed normal subgroup is a Lie group. 

• The universal cover of a connected Lie group is a Lie group. For example, the group R is the universal cover of 
the circle group S 1 . In fact any covering of a differentiable manifold is also a differentiable manifold, but by 
specifying universal cover, one guarantees a group structure (compatible with its other structures). 
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Related notions 

Some examples of groups that are not Lie groups (except in the trivial sense that any group can be viewed as a 
0-dimensional Lie group, with the discrete topology), are: 

• Infinite dimensional groups, such as the additive group of an infinite dimensional real vector space. These are not 
Lie groups as they are not finite dimensional manifolds 

• Some totally disconnected groups, such as the Galois group of an infinite extension of fields, or the additive group 
of the /?-adic numbers. These are not Lie groups because their underlying spaces are not real manifolds. (Some of 
these groups are "/?-adic Lie groups"). In general, only topological groups having similar local properties to R n for 
some positive integer n can be Lie groups (of course they must also have a differentiable structure) 

Early history 

According to the most authoritative source on the early history of Lie groups (Hawkins, p. 1), Sophus Lie himself 
considered the winter of 1873-1874 as the birth date of his theory of continuous groups. Hawkins, however, 
suggests that it was "Lie's prodigious research activity during the four-year period from the fall of 1869 to the fall of 
1873" that led to the theory's creation (ibid). Some of Lie's early ideas were developed in close collaboration with 
Felix Klein. Lie met with Klein every day from October 1869 through 1872: in Berlin from the end of October 1869 
to the end of February 1870, and in Paris, Gottingen and Erlangen in the subsequent two years (ibid, p. 2). Lie stated 
that all of the principal results were obtained by 1884. But during the 1870s all his papers (except the very first note) 
were published in Norwegian journals, which impeded recognition of the work throughout the rest of Europe (ibid, 
p. 76). In 1884 a young German mathematician, Friedrich Engel, came to work with Lie on a systematic treatise to 
expose his theory of continuous groups. From this effort resulted the three- volume Theorie der 
Transformations gruppen, published in 1888, 1890, and 1893. 

Lie's ideas did not stand in isolation from the rest of mathematics. In fact, his interest in the geometry of differential 
equations was first motivated by the work of Carl Gustav Jacobi, on the theory of partial differential equations of 
first order and on the equations of classical mechanics. Much of Jacobi's work was published posthumously in the 
1860s, generating enormous interest in France and Germany (Hawkins, p. 43). Lie's idee fixe was to develop a theory 
of symmetries of differential equations that would accomplish for them what Evariste Galois had done for algebraic 
equations: namely, to classify them in terms of group theory. Lie and other mathematicians showed that the most 
important equations for special functions and orthogonal polynomials tend to arise from group theoretical 
symmetries. Additional impetus to consider continuous groups came from ideas of Bernhard Riemann, on the 
foundations of geometry, and their further development in the hands of Klein. Thus three major themes in 19th 
century mathematics were combined by Lie in creating his new theory: the idea of symmetry, as exemplified by 
Galois through the algebraic notion of a group; geometric theory and the explicit solutions of differential equations 
of mechanics, worked out by Poisson and Jacobi; and the new understanding of geometry that emerged in the works 
of Pliicker, Mobius, Grassmann and others, and culminated in Riemann's revolutionary vision of the subject. 

Although today Sophus Lie is rightfully recognized as the creator of the theory of continuous groups, a major stride 
in the development of their structure theory, which was to have a profound influence on subsequent development of 
mathematics, was made by Wilhelm Killing, who in 1888 published the first paper in a series entitled Die 
Zusammensetzung der stetigen endlichen Transformations gruppen (The composition of continuous finite 
transformation groups) (Hawkins, p. 100). The work of Killing, later refined and generalized by Elie Cartan, led to 
classification of semisimple Lie algebras, Cartan's theory of symmetric spaces, and Hermann Weyl's description of 
representations of compact and semisimple Lie groups using highest weights. 

Weyl brought the early period of the development of the theory of Lie groups to fruition, for not only did he classify 
irreducible representations of semisimple Lie groups and connect the theory of groups with quantum mechanics, but 
he also put Lie's theory itself on firmer footing by clearly enunciating the distinction between Lie's infinitesimal 
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groups (i.e., Lie algebras) and the Lie groups proper, and began investigations of topology of Lie groups (Borel 
(2001), ). The theory of Lie groups was systematically reworked in modern mathematical language in a monograph 
by Claude Che valley. 



The concept of a Lie group, and possibilities of classification 

Lie groups may be thought of as smoothly varying families of symmetries. Examples of symmetries include rotation 
about an axis. What must be understood is the nature of 'small' transformations, e.g., rotations through tiny angles, 
that link nearby transformations. The mathematical object capturing this structure is called a Lie algebra (Lie himself 
called them "infinitesimal groups"). It can be defined because Lie groups are manifolds, so have tangent spaces at 
each point. 

The Lie algebra of any compact Lie group (very roughly: one for which the symmetries form a bounded set) can be 
decomposed as a direct sum of an abelian Lie algebra and some number of simple ones. The structure of an abelian 
Lie algebra is mathematically uninteresting (since the Lie bracket is identically zero); the interest is in the simple 
summands. Hence the question arises: what are the simple Lie algebras of compact groups? It turns out that they 
mostly fall into four infinite families, the "classical Lie algebras" A , B , C and D , which have simple descriptions 

J ° n n n n 

in terms of symmetries of Euclidean space. But there are also just five "exceptional Lie algebras" that do not fall into 
any of these families. E_ is the largest of these. 

Properties 

• The diffeomorphism group of a Lie group acts transitively on the Lie group 

• Every Lie group is parallelizable, and hence an orientable manifold (there is a bundle isomorphism between its 
tangent bundle and the product of itself with the tangent space at the identity) 



Types of Lie groups and structure theory 

Lie groups are classified according to their algebraic properties (simple, semisimple, solvable, nilpotent, abelian), 
their connectedness (connected or simply connected) and their compactness. 

• Compact Lie groups are all known: they are finite central quotients of a product of copies of the circle group S l 
and simple compact Lie groups (which correspond to connected Dynkin diagrams). 

• Any simply connected solvable Lie group is isomorphic to a closed subgroup of the group of invertible upper 
triangular matrices of some rank, and any finite dimensional irreducible representation of such a group is 1 
dimensional. Solvable groups are too messy to classify except in a few small dimensions. 

• Any simply connected nilpotent Lie group is isomorphic to a closed subgroup of the group of invertible upper 
triangular matrices with l's on the diagonal of some rank, and any finite dimensional irreducible representation of 
such a group is 1 dimensional. Like solvable groups, nilpotent groups are too messy to classify except in a few 
small dimensions. 

• Simple Lie groups are sometimes defined to be those that are simple as abstract groups, and sometimes defined to 
be connected Lie groups with a simple Lie algebra. For example, SL 2 (R) is simple according to the second 
definition but not according to the first. They have all been classified (for either definition). 

• Semisimple Lie groups are Lie groups whose Lie algebra is a product of simple Lie algebras They are central 
extensions of products of simple Lie groups. 

The identity component of any Lie group is an open normal subgroup, and the quotient group is a discrete group. 
The universal cover of any connected Lie group is a simply connected Lie group, and conversely any connected Lie 
group is a quotient of a simply connected Lie group by a discrete normal subgroup of the center. Any Lie group G 
can be decomposed into discrete, simple, and abelian groups in a canonical way as follows. Write 

G for the connected component of the identity 

con J 
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G sol for the largest connected normal solvable subgroup 
G nil for the largest connected normal nilpotent subgroup 
so that we have a sequence of normal subgroups 



1 C G C G , C G C G. 

ml sol con 



Then 

GIG is discrete 

con 

G con /G sol is a central extension of a product of simple connected Lie groups. 

C^so/G^ is abelian. A connected abelian Lie group is isomorphic to a product of copies of R and the circle 
group S . 

G nil /1 is nilpotent, and therefore its ascending central series has all quotients abelian. 

This can be used to reduce some problems about Lie groups (such as finding their unitary representations) to the 
same problems for connected simple groups and nilpotent and solvable subgroups of smaller dimension. 

The Lie algebra associated with a Lie group 

To every Lie group, we can associate a Lie algebra, whose underlying vector space is the tangent space of G at the 
identity element, which completely captures the local structure of the group. Informally we can think of elements of 
the Lie algebra as elements of the group that are "infinitesimally close" to the identity, and the Lie bracket is 
something to do with the commutator of two such infinitesimal elements. Before giving the abstract definition we 
give a few examples: 

• The Lie algebra of the vector space is just R" with the Lie bracket given by 

[A, B] = 0. 

(In general the Lie bracket of a connected Lie group is always 0 if and only if the Lie group is abelian.) 

• The Lie algebra of the general linear group GL^(R) of invertible matrices is the vector space MJJl) of square 
matrices with the Lie bracket given by 

[A, B] - AB - BA. 

If G is a closed subgroup of GL (R) then the Lie algebra of G can be thought of informally as the matrices m of 

n 2 
M (R) such that 1 + em is in G, where 8 is an infinitesimal positive number with 8=0 (of course, no such real 

n T 

number 8 exists). For example, the orthogonal group O (R) consists of matrices A with AA = 1, so the Lie algebra 

T n T 2 

consists of the matrices m with (1 + 8m)(l + 8m) = 1, which is equivalent to m + m =0 because 8=0. 

• Formally, when working over the reals, as here, this is accomplished by considering the limit as 8 — > 0; but the 
"infinitesimal" language generalizes directly to Lie groups over general rings. 

The concrete definition given above is easy to work with, but has some minor problems: to use it we first need to 
represent a Lie group as a group of matrices, but not all Lie groups can be represented in this way, and it is not 
obvious that the Lie algebra is independent of the representation we use. To get round these problems we give the 
general definition of the Lie algebra of any Lie group (in 4 steps): 

1 . Vector fields on any smooth manifold M can be thought of as derivations X of the ring of smooth functions on the 
manifold, and therefore form a Lie algebra under the Lie bracket [X, Y] = XY - YX, because the Lie bracket of any 
two derivations is a derivation. 

2. If G is any group acting smoothly on the manifold M, then it acts on the vector fields, and the vector space of 
vector fields fixed by the group is closed under the Lie bracket and therefore also forms a Lie algebra. 

3. We apply this construction to the case when the manifold M is the underlying space of a Lie group G, with G 
acting on G = M by left translations L (h) = gh. This shows that the space of left invariant vector fields (vector 
fields satisfying L X = X for every h in G, where L # denotes the differential of L ) on a Lie group is a Lie 

8 ft 8ft 8 8 
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algebra under the Lie bracket of vector fields. 
4. Any tangent vector at the identity of a Lie group can be extended to a left invariant vector field by left translating 
the tangent vector to other points of the manifold. Specifically, the left invariant extension of an element v of the 
tangent space at the identity is the vector field defined by v A =L t y. This identifies the tangent space T at the 

8 8 e 

identity with the space of left invariant vector fields, and therefore makes the tangent space at the identity into a 
Lie algebra, called the Lie algebra of G, usually denoted by a Fraktur 0. Thus the Lie bracket on 0is given 
explicitly by [v, w] = [v A , w A ]^. 

This Lie algebra 0is finite-dimensional and it has the same dimension as the manifold G. The Lie algebra of G 
determines G up to "local isomorphism", where two Lie groups are called locally isomorphic if they look the same 
near the identity element. Problems about Lie groups are often solved by first solving the corresponding problem for 
the Lie algebras, and the result for groups then usually follows easily. For example, simple Lie groups are usually 
classified by first classifying the corresponding Lie algebras. 

We could also define a Lie algebra structure on 7^ using right invariant vector fields instead of left invariant vector 
fields. This leads to the same Lie algebra, because the inverse map on G can be used to identify left invariant vector 
fields with right invariant vector fields, and acts as -1 on the tangent space 7\ 

The Lie algebra structure on T can also be described as follows: the commutator operation 

(x, y) — > xyx y 

on G x G sends (e, e) to e, so its derivative yields a bilinear operation on TG. This bilinear operation is actually the 
zero map, but the second derivative, under the proper identification of tangent spaces, yields an operation that 
satisfies the axioms of a Lie bracket, and it is equal to twice the one defined through left-invariant vector fields. 

Homomorphisms and isomorphisms 

If G and H are Lie groups, then a Lie-group homomorphism / : G — » H is a smooth group homomorphism. (It is 
equivalent to require only that /be continuous rather than smooth.) The composition of two such homomorphisms is 
again a homomorphism, and the class of all Lie groups, together with these morphisms, forms a category. Two Lie 
groups are called isomorphic if there exists a bijective homomorphism between them whose inverse is also a 
homomorphism. Isomorphic Lie groups are essentially the same; they only differ in the notation for their elements. 

Every homomorphism / : G — » H of Lie groups induces a homomorphism between the corresponding Lie algebras 0 
and f) . The association G > 0is a functor (mapping between categories satisfying certain axioms). 

One version of Ado's theorem is that every finite dimensional Lie algebra is isomorphic to a matrix Lie algebra. For 
every finite dimensional matrix Lie algebra, there is a linear group (matrix Lie group) with this algebra as its Lie 
algebra. So every abstract Lie algebra is the Lie algebra of some (linear) Lie group. 

The global structure of a Lie group is not determined by its Lie algebra; for example, if Z is any discrete subgroup of 
the center of G then G and G/Z have the same Lie algebra (see the table of Lie groups for examples). A connected 
Lie group is simple, semisimple, solvable, nilpotent, or abelian if and only if its Lie algebra has the corresponding 
property. 

If we require that the Lie group be simply connected, then the global structure is determined by its Lie algebra: for 
every finite dimensional Lie algebra 0over F there is a simply connected Lie group G with 0as Lie algebra, unique 
up to isomorphism. Moreover every homomorphism between Lie algebras lifts to a unique homomorphism between 
the corresponding simply connected Lie groups. 
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The exponential map 

The exponential map from the Lie algebra M^(R) of the general linear group GL^(R) to GL^(R) is defined by the 
usual power series: 

A 2 A 3 
exp(A) = l + A + — + — + ••• 

for matrices A. If G is any subgroup of GL^(R), then the exponential map takes the Lie algebra of G into G, so we 
have an exponential map for all matrix groups. 

The definition above is easy to use, but it is not defined for Lie groups that are not matrix groups, and it is not clear 
that the exponential map of a Lie group does not depend on its representation as a matrix group. We can solve both 
problems using a more abstract definition of the exponential map that works for all Lie groups, as follows. 

Every vector v in g determines a linear map from R to 0 taking 1 to v, which can be thought of as a Lie algebra 
homomorphism. Because R is the Lie algebra of the simply connected Lie group R, this induces a Lie group 
homomorphism c : R — » G so that 

c(s + t) = c(s)c(t) 

for all s and t. The operation on the right hand side is the group multiplication in G. The formal similarity of this 
formula with the one valid for the exponential function justifies the definition 

exp(f) = c(l). 

This is called the exponential map, and it maps the Lie algebra 0into the Lie group G. It provides a diffeomorphism 
between a neighborhood of 0 in 0and a neighborhood of e in G. This exponential map is a generalization of the 
exponential function for real numbers (because R is the Lie algebra of the Lie group of positive real numbers with 
multiplication), for complex numbers (because C is the Lie algebra of the Lie group of non-zero complex numbers 
with multiplication) and for matrices (because M^(R) with the regular commutator is the Lie algebra of the Lie group 
GL^(R) of all invertible matrices). 

Because the exponential map is surjective on some neighbourhood N of e, it is common to call elements of the Lie 
algebra infinitesimal generators of the group G. The subgroup of G generated by N is the identity component of G. 

The exponential map and the Lie algebra determine the local group structure of every connected Lie group, because 
of the Baker-Campbell-Hausdorff formula: there exists a neighborhood U of the zero element of 0, such that for u, 
v in U we have 

exp(w) exp(v) = exp(w + v + 1/2 [u, v] + 1/12 [[u, v], v] - 1/12 [[u, v], u] - ...) 

where the omitted terms are known and involve Lie brackets of four or more elements. In case u and v commute, this 
formula reduces to the familiar exponential law exp(w) exp(v) = exp(w + v). 

The exponential map from the Lie algebra to the Lie group is not always onto, even if the group is connected (though 
it does map onto the Lie group for connected groups that are either compact or nilpotent). For example, the 
exponential map of SL 2 (R) is not surjective. 

Infinite dimensional Lie groups 

Lie groups are finite dimensional by definition, but there are many groups that resemble Lie groups, except for being 
infinite dimensional. There is very little "general theory" of such groups, but some of the examples that have been 
studied include: 

• The group of diffeomorphisms of a manifold. Quite a lot is known about the group of diffeomorphisms of the 
circle. Its Lie algebra is (more or less) the Witt algebra, which has a central extension called the Virasoro algebra, 
used in string theory and conformal field theory. Very little is known about the diffeomorphism groups of 
manifolds of larger dimension. The diffeomorphism group of spacetime sometimes appears in attempts to 
quantize gravity. 
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• The group of smooth maps from a manifold to a finite dimensional Lie group is called a gauge group (with 
operation of pointwise multiplication), and is used in quantum field theory and Donaldson theory. If the manifold 
is a circle these are called loop groups, and have central extensions whose Lie algebras are (more or less) 
Kac-Moody algebras. 

• There are infinite dimensional analogues of general linear groups, orthogonal groups, and so on. One important 
aspect is that these may have simpler topological properties: see for example Kuiper's theorem. 

• Just as calculus in finite-dimensional real vector spaces can be extended to calculus in Banach spaces, the 
definition of finite-dimensional smooth manifolds can be extended to give a definition of Banach analytic 
manifolds. Similarly, the standard finite-dimensional definition of Lie groups can be extended to give a definition 
of Banach analytic Lie groups. In this case, we have a Banach analytic manifold which simultaneously has a 
group structure such that multiplication and inversion are analytic maps. Some of the theorems of 
finite-dimensional Lie groups do not carry over to the Banach analytic case, and in particular the relation between 
Lie groups and Lie algebras is much more subtle in the infinite dimensional case. However, it is true that "for 
infinite dimensional Lie groups modeled on Banach spaces there is a well-developed theory ... which is closely 
parallel to the theory of finite dimensional Lie groups.' 

Notes 

[1] Sigurdur Helgason, "Differential Geometry, Lie Groups, and Symmetric Spaces", Academic Press, 1978, page 131. 
[2] Andrew Pressley and Graeme Segal, Loop Groups, Oxford Science Publications, 1986, page 26. 
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Hopf algebra 

In mathematics, a Hopf algebra, named after Heinz Hopf, is a structure that is simultaneously a (unital associative) 
algebra, a (counital coassociative) coalgebra, with these structures compatible making it a bialgebra, and moreover is 
equipped with an antiautomorphism satisfying a certain property. 

Hopf algebras occur naturally in algebraic topology, where they originated and are related to the H-space concept, in 
group scheme theory, in group theory (via the concept of a group ring), and in numerous other places, making them 
probably the most familiar type of bialgebra. Hopf algebras are also studied in their own right, with much work on 
specific classes of examples on the one hand and classification problems on the other. 



Formal definition 

Formally, a Hopf algebra is a bialgebra H over a field K together with a TT-linear map S\ H 
antipode) such that the following diagram commutes: 

S(g>id 



H (called the 




id<g>S 



Here A is the comultiplication of the bialgebra, V its multiplication, n its unit and 8 its counit. In the sumless 
Sweedler notation, this property can also be expressed as 

S r (c( 1 ))c( 2 ) = C(!)5(c( 2 )) = e(c)l for all c G H. 

As for algebras, one can replace the underlying field K with a commutative ring R in the above definition. 

The definition of Hopf algebra is self-dual (as reflected in the symmetry of the above diagram), so if one can define a 
dual of H (which is always possible if H is finite-dimensional), then it is automatically a Hopf algebra. 



Properties of the antipode 

The antipode S is sometimes required to have a 7^-linear inverse, which is automatic in the finite-dimensional case, or 
if H is commutative or cocommutative (or more generally quasitriangular). 

In general, S is an antihomomorphism,^ so S^is a homomorphism, which is therefore an automorphism if S was 
invertible (as may be required). 

If g 2 — Jd , then the Hopf algebra is said to be involutive (and the underlying algebra with involution is a 
*-algebra). If H is finite-dimensional semisimple over a field of characteristic zero, commutative, or cocommutative, 
then it is involutive. 

If a bialgebra B admits an antipode S, then S is unique ("a bialgebra admits at most 1 Hopf algebra structure"). 
The antipode is an analog to the inversion map on a group that sends 9 to g~ 1 ^ 
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Hopf subalgebras 

A subalgebra K (not to be confused with the Field K in the notation above) of a Hopf algebra H is a Hopf subalgebra 
if it is a subcoalgebra of H and the antipode S maps K into K. In other words, a Hopf subalgebra K is a Hopf algebra 
in its own right when the multiplication, comultiplication, counit and antipode of H is restricted to K (and 
additionally the identity 1 is required to be in K). The Nichols-Zoeller Freeness theorem established (in 1989) that 
either natural K-module H is free of finite rank if H is finite dimensional: a generalization of Lagrange's theorem for 
subgroups. As a corollary of this and integral theory, a Hopf subalgebra of a semisimple finite dimensional Hopf 
algebra is automatically semisimple. 

A Hopf subalgebra K is said to be right normal in a Hopf algebra H if it satisfies the condition of stability, 
ad r (h)(K) C K for all h in H, where the right adjoint mapping ad T is defined by 
ad T (h)(k) = iS , (/i( 1 ))fc/i(2)f° r a U k in K, h in H. Similarly, a Hopf subalgebra K is left normal in H if it is stable 
under the left adjoint mapping defined by ad£(h)(k) = h^kS(h^2)) • The two conditions of normality are 

equivalent if the antipode S is bijective, in which case K is said to be a normal Hopf subalgebra. 

A normal Hopf subalgebra K in H satisfies the condition (of equality of subsets of H): = K + H wnere 

denotes the kernel of the counit on K. This normality condition implies that HK^~ 1 ^ a Hopf ideal of H (i.e. an 

algebra ideal in the kernel of the counit, a coalgebra coideal and stable under the antipode). As a consequence one 

has a quotient Hopf algebra H/HK^~znd epimorphism H —> H/K^~H , a theory analogous to that of normal 

Mi 

subgroups and quotient groups in group theory. 

Examples 

Group algebra. Suppose G is a group. The group algebra KG is a unital associative algebra over K. It turns into a 
Hopf algebra if we define 

• A : KG —> KG ® KG by A(g) = g ® g for all g in G 

• 8 : KG Kby e(g) = 1 for all g in G 

• S : KG — » KG by S(g) = g~ l for all g in G. 

Functions on a finite group. Suppose now that G is a finite group. Then the set of all functions from G to K with 
pointwise addition and multiplication is a unital associative algebra over K, and is naturally isomorphic to 

(for G infinite, is a proper subset of K° xG ). The set K° becomes a Hopf algebra if we define 

• A : K° -> K° xG by A(f)(x,y) =f(xy) for all /in K° and all x,y in G 

—> Kby e(f) =f(e) for every /in [here e is the identity element of G] 

• S : K° -> K° by S(f)(x) =f(x~ l ) for all/in K° and all x in G. 

Note that functions on a finite group can be identified with the group ring, though these are more naturally thought of 
as dual - the group ring consists of finite sums of elements, and thus pairs with functions on the group by evaluating 
the function on the summed elements. 

Regular functions on an algebraic group. Generalizing the previous example, we can use the same formulas to 
show that for a given algebraic group G over K, the set of all regular functions on G forms a Hopf algebra. 

Universal enveloping algebra. Suppose g is a Lie algebra over the field K and U is its universal enveloping algebra. 
U becomes a Hopf algebra if we define 

• A : U — » U® Uby A(x) = x®l + l®xfor every x in g (this rule is compatible with commutators and can 
therefore be uniquely extended to all of U). 

• 8 : U — » K by z(x) = 0 for all x in g (again, extended to U) 

• S : U —> U by S(x) = -x for all x in g. 
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Cohomology of Lie groups 

The cohomology algebra of a Lie group is a Hopf algebra: the multiplication is provided by the cup-product, and the 
comultiplication 

H*{G) -> H*(G xG) = H*(G) g> H*(G) 

by the group multiplication G X G — > G • This observation was actually a source of the notion of Hopf algebra. 
Using this structure, Hopf proved a structure theorem for the cohomology algebra of Lie groups. 
Theorem (Hopf) 1 ^ Let A be a finite-dimensional, graded commutative, graded cocommutative Hopf algebra over a 
field of characteristic 0. Then A (as an algebra) is a free exterior algebra with generators of odd degree. 

Quantum groups and non-commutative geometry 

All examples above are either commutative (i.e. the multiplication is commutative) or co-commutative (i.e. A = J 1 o 
A where T: H ® H — » H ® H is defined by T(x ® y) = y ® x). Other interesting Hopf algebras are certain 
"deformations" or "quantizations" of those from example 3 which are neither commutative nor co-commutative. 
These Hopf algebras are often called quantum groups, a term that is so far only loosely defined. They are important 
in noncommutative geometry, the idea being the following: a standard algebraic group is well described by its 
standard Hopf algebra of regular functions; we can then think of the deformed version of this Hopf algebra as 
describing a certain "non-standard" or "quantized" algebraic group (which is not an algebraic group at all). While 
there does not seem to be a direct way to define or manipulate these non-standard objects, one can still work with 
their Hopf algebras, and indeed one identifies them with their Hopf algebras. Hence the name "quantum group". 

Related concepts 

Graded Hopf algebras are often used in algebraic topology: they are the natural algebraic structure on the direct sum 
of all homology or cohomology groups of an H-space. 

Locally compact quantum groups generalize Hopf algebras and carry a topology. The algebra of all continuous 
functions on a Lie group is a locally compact quantum group. 

Quasi-Hopf algebras are generalizations of Hopf algebras, where coassociativity only holds up to a twist. 

Weak Hopf algebras, or quantum groupoids, are generalizations of Hopf algebras. Like Hopf algebras, weak Hopf 
algebras form a self-dual class of algebras; i.e., if H is a (weak) Hopf algebra, so is JJ* , the dual space of linear 
forms on H (with respect to the algebra-coalgebra structure obtained from the natural pairing with H and its 
coalgebra- algebra structure). 

A weak Hopf algebra H is usually taken to be a 1) finite dimensional algebra and coalgebra with coproduct 
A : H — > H ® H and counit e : H ^ k satisfying all the axioms of Hopf algebra except possibly 
A(l) 7^ 1 ® lor e(afe) ^ e(a)e(b)for some a,b in H. Instead one requires that 
(A(l) ® 1)(1 ® A(l)) = (A(l) ® 1)(1 ® A(l)) = (A <8> Id)A(l)and 
e[abc) = y^e(ab(i))e(b( 2 ) c) = e(ab( 2 ))e(b( 1 )c) for all a,b, and c in H. 

2) H has a weakened antipode S : H — » H satisfying the axioms (a) - (c): (a) S{a^)a^) = l(i)e(al(2))f° r 
all a in H (the right-hand side is the interesting projection usually denoted by 11^ (a) or e s (a)with image a 
separable subalgebra denoted by JJ R or H s ); (b) a^S{a^) = f(l(i)a)l(2)for all a in H (another interesting 
projection usually denoted by II jC/ (a)or ^ (a) with image a separable algebra jj L ov i/ t , anti-isomorphic to 
jj L Vm S); and (c) S{a^i^a^2)S{a^ = S{a)for all a in H. Note that if A(l) = 1 ® 1, these conditions 

^Sii^^W^attl^t^^WIlM ftg mitig^f ef «tfggt#te^% rigid tensor category. The unit H-module is the 
separable algebra ff L mentioned above. 
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For example, a finite groupoid algebra is a weak Hopf algebra. In particular, the groupoid algebra on [n] with one 
pair of invertible arrows &ij and ^between i and j in [n] is isomorphic to the algebra H of n x n matrices. The 
weak Hopf algebra structure on this particular H is given by coproduct A(e^j) = e^- ® , counit 6(e^*) = 1 
and antipode S{e.ij) = Cji . The separable subalgebras jj L znd ff R coincide and are non-central commutative 

algebras in this particular case (the subalgebra of diagonal matrices). 

Early theoretical contributions to weak Hopf algebras are to be found in ^ as well as ^ 

Hopf algebroids introduced by J.-H. Lu in 1996 as a result on work on groupoids in Poisson geometry (later shown 
equivalent in nontrivial way to a construction of Takeuchi from the 1970s and another by Xu around the year 2000): 
Hopf algebroids generalize weak Hopf algebras and certain skew Hopf algebras. They may be loosely thought of as 
Hopf algebras over a noncommutative base ring, where weak Hopf algebras become Hopf algebras over a separable 
algebra. It is a theorem that a Hopf algebroid satisfying a finite projectivity condition over a separable algebra is a 
weak Hopf algebra, and conversely a weak Hopf algebra H is a Hopf algebroid over its separable subalgebra ff L . 
The antipode axioms have been changed by G. Bohm and K. Szlachanyi (J. Algebra) in 2004 for tensor categorical 
reasons and to accommodate examples associated to depth two Frobenius algebra extensions. 

A left Hopf algebroid (H,R) is a left bialgebroid together with an antipode: the bialgebroid (H,R) consists of a total 
algebra H and a base algebra R and two mappings, an algebra homomorphism s : R — > H called a source map, an 
algebra anti-homomorphism t : R — > H called a target map, such that the commutativity condition 
s(ri)t(r2) = t(r2)s(ri)is satisfied for all /"i, € R • The axioms resemble those of a Hopf algebra but are 
complicated by the possibility that R is a noncommutative algebra or its images under s and t are not in the center of 
H. In particular a left bialgebroid (H,R) has an R-R-bimodule structure on H which prefers the left side as follows: 
7*1 • h • T2 = s(ri)t{r2)h for all h in H, 7*1 5 7*2 G R • There is a coproduct A : H — > H H and counit 
e ; H —> R mat make (i?, i?, A, e)an R-coring (with axioms like that of a coalgebra such that all mappings are 
R-R-bimodule homomorphisms and all tensors over R). Additionally the bialgebroid (H,R) must satisfy 
A (aft) = A (a) A (6) for all a,b in H, and a condition to make sure this last condition makes sense: every image 
point A(a) satisfies a^t(r) (g) a^ 2 ) = ® a^)s(r) for all r in R. Also A(l) = 1 ® 1. The counit is 

W k ^o^ is S fH^ll ^iuW 0 ^^^)^ ftek(^aFto^^)ktisfying conditions of 
exchanging the source and target maps and satisfying two axioms like Hopf algebra antipode axioms; see the 
references in Lu or in Bohm- Szlachanyi for a more example-category friendly, though somewhat more complicated, 
set of axioms for the antipode S. The latter set of axioms depend on the axioms of a right bialgebroid as well, which 
are a straightforward switching of left to right, s with t, of the axioms for a left bialgebroid given above. 

As an example of left bialgebroid, take R to be any algebra over a field k. Let H be its algebra of linear 

self-mappings. Let s(r) be left multication by r on R; let t(r) be right multiplication by r on R. H is a left bialgebroid 

over R, which may be seen as follows. From the fact that H ®ij H = Honife(i2 ® i? 5 i?)one may define a 

coproduct by A(/)(r® u) — f(ru) for each linear transformation f from R to itself and all r,u in R. 

Coassociativity of the coproduct follows from associativity of the product on R. A counit is given by = f(l) 

. The counit axioms of a coring follow from the identity element condition on multiplication in R. The reader will be 

amused, or at least edified, to check that (H,R) is a left bialgebroid. In case R is an Azumaya algebra, in which case 

H is isomorphic to R tensor R, an antipode comes from transposing tensors, which makes H a Hopf algebroid over R. 
Multiplier Hopf algebras introduced by Alfons Van Daele in 1994^ are generalizations of Hopf algebras where 

comultiplication from an algebra (with or withthout unit) to the multiplier algebra of tensor product algebra of the 

algebra with itself. 

Hopf group- (co)algebras introduced by V.G.Turaev in 2000 are also generalizations of Hopf algebras. 
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Analogy with groups 

Groups can be axiomatized by the same diagrams (equivalently, operations) as a Hopf algebra, where G is taken to 
be a set instead of a module. In this case: 

• the field K is replaced by the 1 -point set 

• there is a natural counit (map to 1 point) 

• there is a natural comultiplication (the diagonal map) 

• the unit is the identity element of the group 

• the multiplication is the multiplication in the group 

• the antipode is the inverse 

rm 

In this philosophy, a group can be thought of as a Hopf algebra over the "field with one element". 

See also 

• Quasitriangular Hopf algebra 

• Algebra/set analogy 

• Representation theory of Hopf algebras 

• Ribbon Hopf algebra 

• Superalgebra 

• Supergroup 

• Anyonic Lie algebra 

Notes 

[1] Dascalescu, Nastasescu & Raianu (2001), Prop. 4.2.6, p. 153 (http://books.google.com/books ?id=pBJ6sbPHA0IC&pg=PA153&dq="is+ 
an+ antimorphism+ of + algebras " ) 

[2] Dascalescu, Nastasescu & Raianu (2001), Remarks 4.2.3, p. 151 (http://books.google.com/books ?id=pBJ6sbPHA0IC&pg=PA151& 

dq="the+antipode+is+unique") 
[3] Quantum groups lecture notes (http://www.mathematik.uni-muenchen.de/~pareigis/Vorlesungen/QuantGrp/ln2_l.pdf) 
[4] S. Montgomery, Hopf algebras and their actions on rings, Conf. Board in Math. Sci. vol. 82, A.M.S., 1993. ISBN 0-8218-0738-2 
[5] Hopf, 1941. 

[6] Gabriella Bohm, Florian Nill, Kornel Szlachanyi. J. Algebra 221 (1999), 385-438 

[7] Dmitri Nikshych, Leonid Vainerman, in: New direction in Hopf algebras, S. Montgomery and H.-J. Schneider, eds., M.S.R.I. Publications, 

vol. 43, Cambridge, 2002, 211-262. 
[8] Alfons Van Daele. Multiplier Hopf algebras (http://www.ams.org/tran/1994-342-02/S0002-9947-1994-1220906-5/ 

S0002-9947-1994-1220906-5.pdf), Transactions of the American Mathematical Society 342(2) (1994) 917-932 
[9] Group = Hopf algebra « Secret Blogging Seminar (http://sbseminar.wordpress.com/2007/10/07/group-hopf-algebra/), Group objects and 

Hopf algebras (http://www.youtube.com/watch?v=p3kkm5dYH-w), video of Simon Willerton. 
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Quantum group 

In mathematics and theoretical physics, the term quantum group denotes various kinds of noncommutative algebra 
with additional structure. In general, a quantum group is some kind of Hopf algebra. There is no single, 
all-encompassing definition, but instead a family of broadly similar objects. 

The term "quantum group" often denotes a kind of noncommutative algebra with additional structure that first 
appeared in the theory of quantum integrable systems, and which was then formalized by Vladimir Drinfel'd and 
Michio Jimbo as a particular class of Hopf algebra. The same term is also used for other Hopf algebras that deform 
or are close to classical Lie groups or Lie algebras, such as a v bicrossproduct' class of quantum groups introduced by 
Shahn Majid a little after the work of Drinfeld and Jimbo. 

In Drinfeld's approach, quantum groups arise as Hopf algebras depending on an auxiliary parameter q or h, which 
become universal enveloping algebras of a certain Lie algebra, frequently semisimple or affine, when q = 1 or h = 0. 
Closely related are certain dual objects, also Hopf algebras and also called quantum groups, deforming the algebra of 
functions on the corresponding semisimple algebraic group or a compact Lie group. 

Just as groups often appear as symmetries, quantum groups act on many other mathematical objects and it has 
become fashionable to introduce the adjective quantum in such cases; for example there are quantum planes and 
quantum Grassmannians. 

Intuitive meaning 

The discovery of quantum groups was quite unexpected, since it was known for a long time that compact groups and 
semisimple Lie algebras are "rigid" objects, in other words, they cannot be "deformed". One of the ideas behind 
quantum groups is that if we consider a structure that is in a sense equivalent but larger, namely a group algebra or a 
universal enveloping algebra, then a group or enveloping algebra can be "deformed", although the deformation will 
no longer remain a group or enveloping algebra. More precisely, deformation can be accomplished within the 
category of Hopf algebras that are not required to be either commutative or cocommutative. One can think of the 
deformed object as an algebra of functions on a "noncommutative space", in the spirit of the noncommutative 
geometry of Alain Connes. This intuition, however, came after particular classes of quantum groups had already 
proved their usefulness in the study of the quantum Yang-Baxter equation and quantum inverse scattering method 
developed by the Leningrad School (Ludwig Faddeev, Leon Takhtajan, Evgenii Sklyanin, Nicolai Reshetikhin and 
Korepin) and related work by the Japanese School. 1 ^ The intuition behind the second, bicrossproduct, class of 
quantum groups was different and came from the search for self-dual objects as an approach to quantum gravity 1 J . 

Drinfel'd- Jimbo type quantum groups 

One type of objects commonly called a "quantum group" appeared in the work of Vladimir Drinfel'd and Michio 
Jimbo as a deformation of the universal enveloping algebra of a semisimple Lie algebra or, more generally, a 
Kac-Moody algebra, in the category of Hopf algebras. The resulting algebra has additional structure, making it into a 
quasitriangular Hopf algebra. 

Let A = (a^)be the Cartan matrix of the Kac-Moody algebra, and let q be a nonzero complex number distinct 

from 1, then the quantum group, U q {G) , where G is the Lie algebra whose Cartan matrix is A, is defined as the 

unital associative algebra with generators k\ (where \ is an element of the weight lattice, i.e. 

2(A, cu) I (o^, OLi) € Z for all /), and and /• (for simple roots, oti ), subject to the following relations: 
• k 0 = 1, 
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ki — jfc" 1 

[ e ii fj] = $ij ~' 
ft - ft 



. V (-l) n - ai3 'k' ! e n e ■ e 1 ~ aij ~ n = 0,fori^j, 



where = fc^, ^ = qhip^w) , [0] g J = 1, [n] g J = [m] gi for all positive integers n, and 

m=l 

q™ - q 7™> 

\rri\ q . = — .These are the q-factorial and q-number, respectively, the q-analogs of the ordinary factorial. 

<li ~ Qi 

The last two relations above are the g-Serre relations, the deformations of the Serre relations. 

In the limit as q —> 1 , these relations approach the relations for the universal enveloping algebra U [G) , where 
k\ — k_\ 

kx —> 1 and > i A as q — > 1 , where the element, t\ , of the Cartan subalgebra satisfies 

q-q- 1 

(t^ : h) = A(/i) for all h in the Cartan subalgebra. 

There are various coassociative coproducts under which these algebras are Hopf algebras, for example, 

• Ai(fc A ) = k x ® fc A , Ai( ei ) = 1 ® e; + e { ® fc i? Ai(/0 = fc," 1 ® £ + £ ® 1, 

• A 2 (fc A ) = k x ®k x , A 2 (ei) = fcr 1 <g> + e< <g> 1 , A 2 (/ i ) = 1 ® £ + £ ® A* , 

• A 3 (fc A ) = k\ k\, A 3 (ei) = k;* ® e, + e, ® fc/> A 3 (£) = fc^ ® + /i ® fc?' where the 

set of generators has been extended, if required, to include fc A for X which is expressible as the sum of an 

element of the weight lattice and half an element of the root lattice. 
In addition, any Hopf algebra leads to another with reversed copproduct T o A » where J 1 is given by 

T[x ® y) = y ® x , giving three more possible versions. 

The counit on U q (A)is the same for all these coproducts: e(/c A ) = 1, e(e^) = 0, e(fi) = 0, and the 
respective antipodes for the above coproducts are given by 

• S'i(fc A ) = fc_ A , 5i(ci) = -ejfcr 1 , Si(/i) = 

• S 2 (fc A ) = fc-A 5 -Safe) = -kie h S 2 (fi) = -fiK 1 , 

• S 3 (k x ) = fc_ A , 5 3 (ei) = -fcei, 5 3 (/i) = -g x rl /i- 

Alternatively, the quantum group [/ g (G)can be regarded as an algebra over the field C(q) , the field of all 
rational functions of an indeterminate q over C • 

Similarly, the quantum group [/ g (G)can be regarded as an algebra over the field Q(g), the field of all rational 
functions of an indeterminate q over (Q) (see below in the section on quantum groups at q = 0). 

Representation theory 

Just as there are many different types of representations for Kac-Moody algebras and their universal enveloping 
algebras, so there are many different types of representation for quantum groups. 

As is the case for all Hopf algebras, U q {G) has an adjoint representation on itself as a module, with the action being 
given by Ad -2/ = J2 X W yS ( X ^> where A ( x ) = E x (l) ® X W. 

(x) (*) 
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Case 1: q is not a root of unity 

One important type of representation is a weight representation, and the corresponding module is called a weight 
module. A weight module is a module with a basis of weight vectors. A weight vector is a nonzero vector v such that 
k\.v = d\V for all \ , where d\ are complex numbers for all weights \ such that 

• do = l, 

• d\d^ — dx+p , for all weights \ and ft . 

A weight module is called integrable if the actions of and f± are locally nilpotent (i.e. for any vector v in the 
module, there exists a positive integer k, possibly dependent on v, such that e±.V = fj*.v = Ofor all /). In the case 
of integrable modules, the complex numbers d\ associated with a weight vector satisfy d\ = C\q^ X ^ » where v 

is an element of the weight lattice, and C\ are complex numbers such that 

• c 0 = 1, 

• c \ c fi — c A+/i , for all weights \ and ft , 

• C2 ai = 1 for all /. 

Of special interest are highest weight representations, and the corresponding highest weight modules. A highest 
weight module is a module generated by a weight vector v, subject to k\.v = d^-ufor all weights \ , and 
e im y = Ofor all i. Similarly, a quantum group can have a lowest weight representation and lowest weight module, 
i.e. a module generated by a weight vector v, subject to k\.v — d\V for all weights \ , and fc.v = Ofor all /. 
Define a vector v to have weight v if k\.v = q^ X ^v for all \ in the weight lattice. 

If G is a Kac-Moody algebra, then in any irreducible highest weight representation of U q {G) , with highest weight 

v , the multiplicities of the weights are equal to their multiplicities in an irreducible representation of JJ (G) with 

equal highest weight. If the highest weight is dominant and integral (a weight ft is dominant and integral if ft 

satisfies the condition that 2(/i, a^) / (a^, a^)is a non-negative integer for all /), then the weight spectrum of the 

irreducible representation is invariant under the Weyl group for G, and the representation is intesrable. 
Conversely, if a highest weight module is integrable, then its highest weight vector v satisfies k\.v = C\q^ ,v ^v » 

where C\ are complex numbers such that 

• Q) = 1, 

• c X c fi = c A+/i , for all weights \ and ft , 

• c 2ai — 1 f° r a U U 

and v is dominant and integral. 

As is the case for all Hopf algebras, the tensor product of two modules is another module. For an element x of 
U q (G), and for vectors v and w in the respective modules, x.(v ® w) = A(x).(i; ® w) » so that 
k\.{v ®w) = k\.v <g> fc A .w , and in the case of coproduct Ai, ei.(v ® w) = ki.v <g> e^.iL? + ei.v ® w 
and /-.(i; <g) w) = V <g) f t .w + /^.^ (g) k^.W • 

The integrable highest weight module described above is a tensor product of a one-dimensional module (on which 
k\ = C\ for all A , and a = fi = 0 for all /) and a highest weight module generated by a nonzero vector , 
subject to k\.VQ = q^^VQ^ov all weights \ , and e^.^o = Ofor all L 

In the specific case where G is a finite-dimensional Lie algebra (as a special case of a Kac-Moody algebra), then the 
irreducible representations with dominant integral highest weights are also finite-dimensional. 

In the case of a tensor product of highest weight modules, its decomposition into submodules is the same as for the 
tensor product of the corresponding modules of the Kac-Moody algebra (the highest weights are the same, as are 
their multiplicities). 
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Quasitriangularity 

Case 1: q is not a root of unity 

Strictly, the quantum group U q {G) is not quasitriangular, but it can be thought of as being "nearly quasitriangular" 

in that there exists an infinite formal sum which plays the role of an /^-matrix. This infinite formal sum is expressible 
in terms of generators 6^ and f{ , and Cartan generators t\ , where k\ is formally identified with g** . The 
infinite formal sum is the product of two factors, g 7 ? t^j®^ , and an infinite formal sum, where {A^} is a basis 

for the dual space to the Cartan subalgebra, and {Uj\ is the dual basis, and Tj is a sign (+1 or -1). 

The formal infinite sum which plays the part of the /^-matrix has a well-defined action on the tensor product of two 

irreducible highest weight modules, and also on the tensor product if two lowest weight modules. Specifically, if v 

has weight a and w has weight /3 , then g 7 ?^ *Aj ^ y ^ w ^ _ qVfaP) v ^ w , and the fact that the 

modules are both highest weight modules or both lowest weight modules reduces the action of the other factor on 
v & w to a finite sum. 

Specifically, if V is a highest weight module, then the formal infinite sum, R, has a well-defined, and invertible, 
action on V ®V> and this value of R (as an element of Hom(l^) ® Hom(l^)) satisfies the Yang-Baxter 
equation, and therefore allows us to determine a representation of the braid group, and to define quasi-invariants for 
knots, links and braids. 

Quantum groups at q - 0 

Masaki Kashiwara has researched the limiting behaviour of quantum groups as q — > 0 . 

As a consequence of the defining relations for the quantum group U q (G) , U q (G) can be regarded as a Hopf 
algebra over Q(g) , the field of all rational functions of an indeterminate q over Q . 

For simple root o^and non-negative integer n , define g( n ) — e™ /[n] q . ! an d fj^ = f\p\ q .\ (specifically, 

= fj® = 1). In an integrable module M » an d for weight A , a vector u £ M\ (i.e. a vector u in M 

with weight \ ) can be uniquely decomposed into the sums 
oo oo 

where u n £ ker(e^) H M\+ noii , v n £ ker(/j) D M\- na , u n ^ Oonly if rc + Q ^ > Q, an d 

v n 7^ 0 only if n — 7^—^ — ^ > 0 . Linear mappings &i : M —> M and f . • M — > M can be defined on 
M A by 

OO OO 

r(n-l) (n+1) 



n=l n=0 
oc 00 



71=0 71=1 

Let A be the integral domain of all rational functions in Q(g) which are regular at q = 0(Le. a. rational function 
/(q)is an element of A if an d only if there exist polynomials g(q) and h(q) in the polynomial ring Q[g] such 

that h(0) 7^ 0 , and /(g) = g(q) /h(q) ). A crystal base for M is an ordered pair (L, B) , such that 

• L is a free -submodule of M such that M = <8U 

• B is a Q -basis of the vector space L/qL over Q 5 

• L = @\L\ and B = U X B X , where L A = L H M A and 5 A = 5 n (L A /gL A ) 5 

• e { L C L and c L for all i, 



Quantum group 



105 



• e { B C B U {0} and f { BcBU {0} for all i, 

• for all b € B and b ! G £?, and for all z, = b f if and only if fib* = 6. 

To put this into a more informal setting, the actions of e^/, and are generally singular at 5 = Oon an 
integrable module M • The linear mappings and f. on the module are introduced so that the actions of 
and J^e^ are regular at g = Oon the module. There exists a Q(g) -basis of weight vectors £ for M , with 
respect to which the actions of and ^ are regular at q = 0 for all /. The module is then restricted to the free A 
-module generated by the basis, and the basis vectors, the A -submodule and the actions of and f. are evaluated 
at 5 = 0. Furthermore, the basis can be chosen such that at q — 0 , for all % , and f { are represented by 

edges. Each vertex of the graph represents an 
element of the Q -basis B of L/qL , and a directed edge, labelled by /, and directed from vertex v ito vertex 
V2, represents that fr 2 = ^^(and, equivalently, that b\ — €$2), where fe 1 is the basis element represented by 
Vl, and 6 2 i s me basis element represented by V2. The graph completely determines the actions of and at 
q = 0 . If an integrable module has a crystal base, then the module is irreducible if and only if the graph 
representing the crystal base is connected (a graph is called "connected" if the set of vertices cannot be partitioned 
into the union of nontrivial disjoint subsets l^and V^such that there are no edges joining any vertex in Vjto any 
vertex in V2). 

For any integrable module with a crystal base, the weight spectrum for the crystal base is the same as the weight 
spectrum for the module, and therefore the weight spectrum for the crystal base is the same as the weight spectrum 
for the corresponding module of the appropriate Kac-Moody algebra. The multiplicities of the weights in the crystal 
base are also the same as their multiplicities in the corresponding module of the appropriate Kac-Moody algebra. 

It is a theorem of Kashiwara that every integrable highest weight module has a crystal base. Similarly, every 
integrable lowest weight module has a crystal base. 

Tensor products of crystal bases 

Let Afbe an integrable module with crystal base (L, B) and TVf'be an integrable module with crystal base 
(Z/, B f ). For crystal bases, the coproduct A> given by 

A(fc A ) = k x ® fc A , Afe) = e { <g> kr 1 + 1 ® e h A(fi) = f { ® 1 + k { ® f { , is adopted. The integrable 
module M ®Q( q ) M f has crystal base (L ®^ L* \B <g> B') , where 

B <g> B* = {b ®q b f : b € B, b f € B r } . For a basis vector b G B , define 
6i(b) = max{n > 0 : e"b ^ 0} and 0.(6) = max{n > 0 : f?b ^ 0} • The actions of ^and £ on 

^ b9b )-\bi»S i V i if 0 i (6)<e t (V) l 



W6 ® 6J -\6®/^ if 0,(6) < 6,(6')- 



The decomposition of the product two integrable highest weight modules into irreducible submodules is determined 
by the decomposition of the graph of the crystal base into its connected components (i.e. the highest weights of the 
submodules are determined, and the multiplicity of each highest weight is determined). 
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Compact matrix quantum groups 

See also compact quantum group. 

S.L. Woronowicz introduced compact matrix quantum groups. Compact matrix quantum groups are abstract 
structures on which the "continuous functions" on the structure are given by elements of a C* -algebra. The geometry 
of a compact matrix quantum group is a special case of a noncommutative geometry. 

The continuous complex-valued functions on a compact Hausdorff topological space form a commutative 
C*-algebra. By the Gelfand theorem, a commutative C*-algebra is isomorphic to the C*-algebra of continuous 
complex-valued functions on a compact Hausdorff topological space, and the topological space is uniquely 
determined by the C*-algebra up to homeomorphism. 

For a compact topological group, G, there exists a C*-algebra homomorphism A : G(G) — > G(G) ® G(G) 
(where G(G) ® (7(G) is the C*-algebra tensor product - the completion of the algebraic tensor product of G(G) 
and G(G)), such that A(/)(x ? y) = f{xy) for all / £ G(G), and for all x, y G G (where 
(/ ® 9){ x ? y) — f( x )9{y)^ or a ^ f->9^ G(G) and all x : y £ G ). There also exists a linear multiplicative 
mapping k : G(G) G(G) , such that K (f) ( x ) = f{x~ l ) for all / £ G(G) and all x <E G • Strictly, this 
does not make G(G) a Hopf algebra, unless G is finite. On the other hand, a finite-dimensional representation of G 
can be used to generate a *-subalgebra of G(G) which is also a Hopf *-algebra. Specifically, if ^ i — > {v>ij{g))ij 
is an n -dimensional representation of G > then ^ij £ G(G) for all i, J , and = 5^ w * fc ® ^fejfor all 

i, j . It follows that the *-algebra generated by ^ijfor all i, j and ^(li^for all 2, j is a Hopf *-algebra: the 
counit is determined by e{u{j) = 5^- for all i, j (where (5^- is the Kronecker delta), the antipode is ft , and the 

Mttahb^Wiy.-^^^ - a pair (C, .) , where C is a C-a lg ee,a and 
^ = (l^j*)^ n is a matrix with entries in (7 such that 

• The *-subalgebra, Gq, of (7 » which is generated by the matrix elements of w , is dense in (7 j 

• There exists a C*-algebra homomorphism A : G — > C ® C (where G ® C is the C*-algebra tensor 
product - the completion of the algebraic tensor product of C and G ) such that 

A (^j) = Y^ u ^® u kj 
k 

for all i, j ( A is called the ^multiplication); 

• There exists a linear antimultiplicative map k : Gq — > Gq (the coinverse) such that «(ft(x;*)*) = for all 
u (E Gq and 5^ K ^} l ik) u kj — u ik^{ u kj) — where / is the identity element of C • Since K is 

A; 

antimultiplicative, then k(vw) = ft(tL?)ft(?;) for all v^w G Gq. 
As a consequence of continuity, the comultiplication on G is coassociative. 

In general, G is n °t a bialgebra, and Gois a Hopf *-algebra. 

Informally, G can be regarded as the *-algebra of continuous complex- valued functions over the compact matrix 
quantum group, and u can be regarded as a finite-dimensional representation of the compact matrix quantum group. 
A representation of the compact matrix quantum group is given by a corepresentation of the Hopf * -algebra (a 
corepresentation of a counital coassociative coalgebra A is a square matrix v = (vij)i : j=i jm .. :7l with entries in A 

71 

(so v £ M n (A) ) such that A{v{j) = ^ ® v^jfor all i, j and e(vij) = 5{j for all i, j ). Furthermore, 

fc=l 

a representation v, is called unitary if the matrix for v is unitary (or equivalently, if K>{vij) — Vji for all ij). 
An example of a compact matrix quantum group is S 11^(2) , where the parameter ft is a positive real number. So 
SU^{2) = (G(S , [/ /i (2), it), where C{SU^ (2)) is the C*-algebra generated by a and 7,subjectto 
77* — 7*7? a 7 — AH^i a 7* = / i 7* a ? aa * + ^7*7 = a * a + / i_1 7*7 — ^> 
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and w = 



( OL T \ 

and u = I ^ # 1 , so that the comultiplication is determined by A (a) = a. ® a. — 7 £5 7*, 

A(7) = a®7 + 7®a*, and the coinverse is determined by k{o) = a*, ^(7) = — A i_1 7' 
^(7*) = —fij* , = Q . Note that u is a representation, but not a unitary representation, w is equivalent 

to the unitary representation v = I * * 1 ■ 

Equivalently, = (0(517^(2)), where C(5C/ /z (2))is the C*-algebra generated by a and j3 

, subject to 

/3fi* = /?*/?, a/3 = fi/3a, a{3* = fif3*a, aa* + (i 2 p*P = a*a + (3* (3 = I, 

^* ,S0 ^ at ^ e comu l t iP^ cat i° n i s determined by A (a) = a ® a — fi/3 ® /?*, 

A(/3) = a®/3 + /?®a*, and the coinverse is determined by k{ql) = a*, = — 

K,(j3*) = —flft* , = a . Note that luis a unitary representation. The realizations can be identified by 

equating 7 = \fji<(3 • 

When fj, = 1, then S'[/ /X (2)is equal to the algebra C(iS'C/(2))of functions on the concrete compact group 

517(2). 

Bicrossproduct quantum groups 

Whereas compact matrix pseudogroups are typically versions of Drinfeld-Jimbo quantum groups in a dual function 
algebra formulation, with additional structure, the bicrossproduct ones are a distinct second family of quantum 
groups of increasing importance as deformations of solvable rather than semisimple Lie groups. They are associated 
to Lie splittings of Lie algebras or local factorisations of Lie groups and can be viewed as the cross product or 
Mackey quantisation of one of the factors acting on the other for the algebra and a similar story for the coproduct A 
with the second factor acting back on the first. The very simplest nontrivial example corresponds to two copies of 
locally acting on each other and results in a quantum group (given here in an algebraic form) with generators 
p, K : K —1 > say, and coproduct 

[p, K] = hK(K -l),Ap = p®K + l®p,AK = K®K 

where h is me deformation parameter. This quantum group was linked to a toy model of Planck scale physics 
implementing Born reciprocity when viewed as a deformation of the Heisenberg algebra of quantum mechanics. 
Also, starting with any compact real form of a semisimple Lie algebra 9 its complexification as a real Lie algebra of 
twice the dimension splits into 9 and a certain solvable Lie algebra (the Iwasawa decomposition), and this provides 
a canonical bicrossproduct quantum group associated to 9 • For su(2) one obtains a quantum group deformation of 
the Euclidean group E(3) of motions in 3 dimensions. 



Notes 

[1] Schwiebert, Christian (1994), Generalized quantum inverse scattering, arXiv:hep-th/9412237v3 
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Affine quantum group 

In mathematics, a quantum affine algebra (or affine quantum group) is a Hopf algebra that is a ^-deformation of 
the universal enveloping algebra of an affine Lie algebra. They were introduced independently by Drinfeld (1985) 
and Jimbo (1985) as a special case of their general construction of a quantum group from a Cartan matrix. One of 
their principal applications has been to the theory of solvable lattice models in quantum statistical mechanics, where 
the Yang-Baxter equation occurs with a spectral parameter. Combinatorial aspects of the representation theory of 
quantum affine algebras can be described simply using crystal bases, which correspond to the degenerate case when 
the deformation parameter q vanishes and the Hamiltonian of the associated lattice model can be explicitly 
diagonalized. 
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Affine Lie algebra 

In mathematics, an affine Lie algebra is an infinite-dimensional Lie algebra that is constructed in a canonical 
fashion out of a finite-dimensional simple Lie algebra. It is a Kac-Moody algebra whose generalized Cartan matrix 
is positive semi-definite and has corank 1. From purely mathematical point of view, affine Lie algebras are 
interesting because their representation theory, like representation theory of finite dimensional, semisimple Lie 
algebras is much better understood than that of general Kac-Moody algebras. As observed by Victor Kac, the 
character formula for representations of affine Lie algebras implies certain combinatorial identities, the Macdonald 
identities. 

Affine Lie algebras play an important role in string theory and conformal field theory due to the way they are 
constructed: starting from a simple Lie algebra 0, one considers the loop algebra, Lg, formed by the 0 -valued 
functions on a circle (interpreted as the closed string) with pointwise commutator. The affine Lie algebra gis 
obtained by adding one extra dimension to the loop algebra and modifying a commutator in a non-trivial way, which 
physicists call a quantum anomaly. The point of view of string theory helps to understand many deep properties of 
affine Lie algebras, such as the fact that the characters of their representations are given by modular forms. 

Affine Lie algebras from simple Lie algebras 
Construction 

If 0is a finite dimensional simple Lie algebra, the corresponding affine Lie algebra gis constructed as a central 
extension of the infinite-dimensional Lie algebra g (g) C[t, t -1 ] , with one-dimensional center Cc. As a vector 
space, 

0 = A ® C[M _1 ] © Cc, 

where C[f, £ _1 ]is the complex vector space of Laurent polynomials in the indeterminate t. The Lie bracket is 
defined by the formula 

[a ® t n + ac, b®t rn + (3c} = [a, b] <g> i n+m + {a^nS^^c 
for all a, b <E 0, a, (3 G C and n, m £ Z , where [a, b] is the Lie bracket in the Lie algebra 0and (-| ■) is the 
Cartan-Killing form on 0. 

The affine Lie algebra corresponding to a finite-dimensional semisimple Lie algebra is the direct sum of the affine 
Lie algebras corresponding to its simple summands. 

Constructing the Dynkin diagrams 

The Dynkin diagram of each affine Lie algebra consists of that of the corresponding simple Lie algebra plus an 
additional node, which corresponds to the addition of an imaginary root. Of course, such a node cannot be attached 
to the Dynkin diagram in just any location, but for each simple Lie algebra there exists a number of possible 
attachments equal to the cardinality of the group of outer automorphisms of the Lie algebra. In particular, this group 
always contains the identity element, and the corresponding affine Lie algebra is called an untwisted affine Lie 
algebra. When the simple algebra admits automorphisms that are not inner automorphisms, one may obtain other 
Dynkin diagrams and these correspond to twisted affine Lie algebras. 
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Classifying the central extensions 

The attachment of an extra node to the Dynkin diagram of the corresponding simple Lie algebra corresponds to the 
following construction. An affine Lie algebra can always be constructed as a central extension of the loop algebra of 
the corresponding simple Lie algebra. If one wishes to begin instead with a semisimple Lie algebra, then one needs 
to centrally extend by a number of elements equal to the number of simple components of the semisimple algebra. In 
physics, one often considers instead the direct sum of a semisimple algebra and an abelian algebra . In this case 
one also needs to add n further central elements for the n abelian generators. 

The second integral cohomology of the loop group of the corresponding simple compact Lie group is isomorphic to 
the integers. Central extensions of the affine Lie group by a single generator are topologically circle bundles over 
this free loop group, which are classified by a two-class known as the first Chern class of the fibration. Therefore the 
central extensions of an affine Lie group are classified by a single parameter k which is called the central charge in 
the physics literature, where it first appeared. Unitary highest weight representations of the affine compact groups 
only exist when k is a natural number. More generally, if one considers a semi- simple algebra, there is a central 
charge for each simple component. 

Applications 

They appear naturally in theoretical physics (for example, in conformal field theories such as the WZW model and 
coset models and even on the worldsheet of the heterotic string), geometry, and elsewhere in mathematics. 
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Quantum affine algebra 

In mathematics, a quantum affine algebra (or affine quantum group) is a Hopf algebra that is a ^-deformation of 
the universal enveloping algebra of an affine Lie algebra. They were introduced independently by Drinfeld (1985) 
and Jimbo (1985) as a special case of their general construction of a quantum group from a Cartan matrix. One of 
their principal applications has been to the theory of solvable lattice models in quantum statistical mechanics, where 
the Yang-Baxter equation occurs with a spectral parameter. Combinatorial aspects of the representation theory of 
quantum affine algebras can be described simply using crystal bases, which correspond to the degenerate case when 
the deformation parameter q vanishes and the Hamiltonian of the associated lattice model can be explicitly 
diagonalized. 
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Operator algebra 

In functional analysis, an operator algebra is an algebra of continuous linear operators on a topological vector space 
with the multiplication given by the composition of mappings. Although it is usually classified as a branch of 
functional analysis, it has direct applications to representation theory, differential geometry, quantum statistical 
mechanics and quantum field theory. 

Such algebras can be used to study arbitrary sets of operators with little algebraic relation simultaneously. From this 
point of view, operator algebras can be regarded as a generalization of spectral theory of a single operator. In general 
operator algebras are non-commutative rings. 

An operator algebra is typically required to be closed in a specified operator topology inside the algebra of the whole 
continuous linear operators. In particular, it is a set of operators with both algebraic and topological closure 
properties. In some disciplines such properties are axiomized and algebras with certain topological structure become 
the subject of the research. 

Though algebras of operators are studied in various contexts (for example, algebras of pseudo-differential operators 
acting on spaces of distributions), the term operator algebra is usually used in reference to algebras of bounded 
operators on a Banach space or, even more specially in reference to algebras of operators on a separable Hilbert 
space, endowed with the operator norm topology. 

In the case of operators on a Hilbert space, the adjoint map on operators gives a natural involution which provides an 
additional algebraic structure which can be imposed on the algebra. In this context, the best studied examples are 
self-adjoint operator algebras, meaning that they are closed under taking adjoints. These include C*-algebras and von 
Neumann algebras. C*-algebras can be easily characterized abstractly by a condition relating the norm, involution 
and multiplication. Such abstractly defined C* -algebras can be identified to a certain closed subalgebra of the 
algebra of the continuous linear operators on a suitable Hilbert space. A similar result holds for von Neumann 
algebras. 

Commutative self-adjoint operator algebras can be regarded as the algebra of complex valued continuous functions 
on a locally compact space, or that of measurable functions on a standard measurable space. Thus, general operator 
algebras are often regarded as a noncommutative generalizations of these algebras, or the structure of the base space 
on which the functions are defined. This point of view is elaborated as the philosophy of noncommutative geometry, 
which tries to study various non-classical and/or pathological objects by noncommutative operator algebras. 

Examples of operator algebras which are not self-adjoint include: 

• nest algebras 

• many commutative subspace lattice algebras 

• many limit algebras 
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Clifford algebra 

In mathematics, Clifford algebras are a type of associative algebra. They can be thought of as one of the possible 
generalizations of the complex numbers and quaternions. The theory of Clifford algebras is intimately connected 
with the theory of quadratic forms and orthogonal transformations. Clifford algebras have important applications in a 
variety of fields including geometry and theoretical physics. They are named after the English geometer William 
Kingdon Clifford. 

Introduction and basic properties 

Specifically, a Clifford algebra is a unital associative algebra which contains and is generated by a vector space V 
equipped with a quadratic form Q. The Clifford algebra C£(V,Q) is the "freest" algebra generated by V subject to the 
condition^ 

v 2 = Q(v)l for all vtV. 

If the characteristic of the ground field K is not 2, then one can rewrite this fundamental identity in the form 

uv + vu = 2(ii 5 v) for all v € V, 

where <u, v> = l /2(Q(u + v) - Q(u) - Q(v)) is the symmetric bilinear form associated to Q, via the polarization 
identity. The idea of being the "freest" or "most general" algebra subject to this identity can be formally expressed 
through the notion of a universal property, as done below. 

Quadratic forms and Clifford algebras in characteristic 2 form an exceptional case. In particular, if char K = 2 it is 
not true that a quadratic form determines a symmetric bilinear form, or that every quadratic form admits an 
orthogonal basis. Many of the statements in this article include the condition that the characteristic is not 2, and are 
false if this condition is removed. 

As quantization of exterior algebra 

Clifford algebras are closely related to exterior algebras. In fact, if Q = 0 then the Clifford algebra C£(V,Q) is just the 
exterior algebra A(V). For nonzero Q there exists a canonical linear isomorphism between A(V) and C£(V,Q) 
whenever the ground field K does not have characteristic two. That is, they are naturally isomorphic as vector spaces, 
but with different multiplications (in the case of characteristic two, they are still isomorphic as vector spaces, just not 
naturally). Clifford multiplication is strictly richer than the exterior product since it makes use of the extra 
information provided by Q. 

More precisely, Clifford algebras may be thought of as quantizations (cf. quantization (physics), Quantum group) of 
the exterior algebra, in the same way that the Weyl algebra is a quantization of the symmetric algebra. 

Weyl algebras and Clifford algebras admit a further structure of a *-algebra, and can be unified as even and odd 
terms of a superalgebra, as discussed in CCR and CAR algebras. 

Universal property and construction 

Let V be a vector space over a field K, and let Q : V —> K be a quadratic form on V. In most cases of interest the field 
K is either R, C or a finite field. 

A Clifford algebra C£(V,Q) is a unital associative algebra over K together with a linear map i : V —> C£{V,Q) 

2 

satisfying i(v) = Q(v)l for all v E V, defined by the following universal property: Given any associative algebra A 
over K and any linear map j : V —> A such that 

j( v ) 2 =Q(v)l for alive V 
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(where 1 denotes the multiplicative identity of A), there is a unique algebra homomorphism / : C£(V,Q) —> A such 
that the following diagram commutes (i.e. such that/o i = j): 

v — l -+ct(y,Q) 




* 



A 

Working with a symmetric bilinear form < , > instead of Q (in characteristic not 2), the requirement on j is 

j(v)j(w) + j(w)j(v) = 2<v, w> for all v, w £ V. 

A Clifford algebra as described above always exists and can be constructed as follows: start with the most general 
algebra that contains V, namely the tensor algebra T(V), and then enforce the fundamental identity by taking a 
suitable quotient. In our case we want to take the two-sided ideal / in T(V) generated by all elements of the form 

v ® v - Q(v)lfor all y £ V 

and define C£(V,Q) as the quotient 
C£(V,Q) = T(V)/I Q 

It is then straightforward to show that C£(V,Q) contains V and satisfies the above universal property, so that CI is 
unique up to a unique isomorphism; thus one speaks of "the" Clifford algebra C€(V, Q). It also follows from this 
construction that i is injective. One usually drops the i and considers V as a linear subspace of C£(V,Q). 

The universal characterization of the Clifford algebra shows that the construction of C£(V,Q) is functorial in nature. 
Namely, C£ can be considered as a functor from the category of vector spaces with quadratic forms (whose 
morphisms are linear maps preserving the quadratic form) to the category of associative algebras. The universal 
property guarantees that linear maps between vector spaces (preserving the quadratic form) extend uniquely to 
algebra homomorphisms between the associated Clifford algebras. 



Basis and dimension 

If the dimension of V is n and { , . . . ,e } is a basis of V, then the set 

1 n 

{e^e^ - -e ik | 1 < t x < i 2 < ■ ■ - < ik < n and 0 < k < n} 

is a basis for C£(V,Q). The empty product (k = 0) is defined as the multiplicative identity element. For each value of 
k there are n choose k basis elements, so the total dimension of the Clifford algebra is 

dimC*(KQ) = £Q =T. 

Since V comes equipped with a quadratic form, there is a set of privileged bases for V: the orthogonal ones. An 
orthogonal basis is one such that 

(e;,^) =0 i^j. 

where <•,•> is the symmetric bilinear form associated to Q. The fundamental Clifford identity implies that for an 
orthogonal basis 

This makes manipulation of orthogonal basis vectors quite simple. Given a product e^e^ • • • e^ fc of distinct 
orthogonal basis vectors, one can put them into standard order by including an overall sign corresponding to the 
number of flips needed to correctly order them (i.e. the signature of the ordering permutation). 

If the characteristic is not 2 then an orthogonal basis for V exists, and one can easily extend the quadratic form on V 
to a quadratic form on all of C£(V,Q) by requiring that distinct elements e^e^ • * * e^ fc are orthogonal to one another 
whenever the {e.}'s are orthogonal. Additionally, one sets 
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Q{e h e i2 • • -e ifc ) = Q{e il )Q{e i2 ) • -Q(e ik ). 



2 

The quadratic form on a scalar is just Q(X) = X . Thus, orthogonal bases for V extend to orthogonal bases for 
C£(V,<2). The quadratic form defined in this way is actually independent of the orthogonal basis chosen (a 
basis-independent formulation will be given later). 

Examples: real and complex Clifford algebras 

The most important Clifford algebras are those over real and complex vector spaces equipped with nondegenerate 
quadratic forms. The geometric interpretation of nondegenerate Clifford algebras is known as geometric algebra. 

Every nondegenerate quadratic form on a finite-dimensional real vector space is equivalent to the standard diagonal 
form: 

Q{v)=vl + --- + vl-vl +l v 2 p+g 

where n = p + q is the dimension of the vector space. The pair of integers (/?, q) is called the signature of the 
quadratic form. The real vector space with this quadratic form is often denoted R p,q . The Clifford algebra on R p,q is 
denoted C€ p ^(R). The symbol CiJJL) means either Cl^ Q (R) or C£ Q ^(R) depending on whether the author prefers 
positive definite or negative definite spaces. 

A standard orthonormal basis {e.} for R p,q consists of n = p + q mutually orthogonal vectors, p of which have norm 
+1 and q of which have norm -1. The algebra C^^(R) will therefore have p vectors which square to +1 and q 
vectors which square to -1. 

Note that C^ 0Q (R) is naturally isomorphic to R since there are no nonzero vectors. Cl^ ^R) is a two-dimensional 
algebra generated by a single vector which squares to -1, and therefore is isomorphic to C, the field of complex 
numbers. The algebra C£ Q2 (R) is a four-dimensional algebra spanned by {1, e^e^}. The latter three elements 

square to -1 and all anticommute, and so the algebra is isomorphic to the quaternions H. The next algebra in the 
sequence is Cl^ 3 (R) is an 8 -dimensional algebra isomorphic to the direct sum H © H called split-biquaternions. 

One can also study Clifford algebras on complex vector spaces. Every nondegenerate quadratic form on a complex 
vector space is equivalent to the standard diagonal form 

Q(z) = zl + 4 + --- + zl 

where n = dim V, so there is essentially only one Clifford algebra in each dimension. We will denote the Clifford 
algebra on C n with the standard quadratic form by C^(C). One can show that the algebra Ci^iC) m ay be obtained as 
the complexification of the algebra C€ p ^(R) where n=p + q: 

ci n {c) = ce Ptq (R) <g> c = ce{c p + q , q ® c). 

Here Q is the real quadratic form of signature (p,q). Note that the complexification does not depend on the signature. 
The first few cases are not hard to compute. One finds that 

C^ 0 (C) = C 

C^(C) = c © c 

C£ 2 (C) = M 2 (C) 

where M (C) denotes the algebra of 2x2 matrices over C. 

It turns out that every one of the algebras CI (R) and C^(C) is isomorphic to a matrix algebra over R, C, or H or 
to a direct sum of two such algebras. For a complete classification of these algebras see classification of Clifford 
algebras. 
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Properties 

Relation to the exterior algebra 

Given a vector space V one can construct the exterior algebra A(V), whose definition is independent of any quadratic 
form on V. It turns out that if F does not have characteristic 2 then there is a natural isomorphism between A(V) and 
C£(V,Q) considered as vector spaces (and there exists an isomorphism in characteristic two, which may not be 
natural). This is an algebra isomorphism if and only if Q = 0. One can thus consider the Clifford algebra C£(V,Q) as 
an enrichment (or more precisely, a quantization, cf. the Introduction) of the exterior algebra on V with a 
multiplication that depends on Q (one can still define the exterior product independent of Q). 

The easiest way to establish the isomorphism is to choose an orthogonal basis {e.} for V and extend it to an 
orthogonal basis for C£(V,Q) as described above. The map C£(V,Q) —> A(V) is determined by 
e h e i2 ■••e^e il Ae, 2 A---A e ik . 

Note that this only works if the basis {e.} is orthogonal. One can show that this map is independent of the choice of 
orthogonal basis and so gives a natural isomorphism. 

If the characteristic of K is 0, one can also establish the isomorphism by antisymmetrizing. Define functions : V x 
... xV^C£(V,g)by 

fk(v u ' ' ' ,v k ) = ^ J2 s S n ( a ) Mi) * * * M*) 

" <rES k 

where the sum is taken over the symmetric group on k elements. Since/ is alternating it induces a unique linear map 

k 

A (V) —> C€(V,Q). The direct sum of these maps gives a linear map between A(V) and C£(V,Q). This map can be 
shown to be a linear isomorphism, and it is natural. 

A more sophisticated way to view the relationship is to construct a filtration on C£(V,Q). Recall that the tensor 
algebra T(V) has a natural filtration: F® C F l C F 2 C ... where F k contains sums of tensors with rank < k. Projecting 
this down to the Clifford algebra gives a filtration on C£{V,Q). The associated graded algebra 

Gr F C£(V,Q) = ®F k /F k - 1 
k 

is naturally isomorphic to the exterior algebra A(V). Since the associated graded algebra of a filtered algebra is 
always isomorphic to the filtered algebra as filtered vector spaces (by choosing complements of F k inF k+l for all k), 
this provides an isomorphism (although not a natural one) in any characteristic, even two. 

Grading 

In the following, assume that the characteristic is not 2. 

Clifford algebras are Z 2 -graded algebras (also known as superalgebras). Indeed, the linear map on V defined by 
V — v (reflection through the origin) preserves the quadratic form Q and so by the universal property of Clifford 
algebras extends to an algebra automorphism 

a : C€(V,Q) -> C€(V,Q). 

Since a is an involution (i.e. it squares to the identity) one can decompose C£(V,Q) into positive and negative 
eigenspaces 

Cl{V t Q) = C£°(V, Q) e C£\V, Q) 

where C£ l (V,Q) ={xE C£(V,Q) I a(x) = (-1) l x}. Since a is an automorphism it follows that 

CV{V, Q)C£ j {V, Q) = C£ i+j {V, Q) 
where the superscripts are read modulo 2. This gives Ct(V,Q) the structure of a Z 2 -graded algebra. The subspace 

Q) forms a subalgebra of C£(V,<2), called the even subalgebra. The subspace CI (V 9 Q) is called the odd part 
of C£(V,Q) (it is not a subalgebra). The Z 2 -grading plays an important role in the analysis and application of Clifford 
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algebras. The automorphism a is called the main involution or grade involution. 

Remark. In characteristic not 2 the underlying vector space of C£(V,Q) inherits a Z-grading from the canonical 
isomorphism with the underlying vector space of the exterior algebra A(V). It is important to note, however, that this 
is a vector space grading only. That is, Clifford multiplication does not respect the Z-grading, only the Z-grading: 
for instance if Q(y) ^ 0, then y £ C^(V,Q), but y 2 <= C£°(V : Q), not in C£ 2 (V 5 Q). Happily, the 
gradings are related in the natural way: Z 2 = Z/2Z. Further, the Clifford algebra is Z-filtered: 
Ci-^V, Q) • C£- j (V : Q) C Cl- i+j (V, Q) • The degree of a Clifford number usually refers to the degree in 
the Z-grading. Elements which are pure in the Z-grading are simply said to be even or odd. 

The even subalgebra C£°(V,Q) of a Clifford algebra is itself a Clifford algebra^ . If Vis the orthogonal direct sum of 
a vector a of norm Q(a) and a subspace U, then C£°(V,Q) is isomorphic to C£{U-Q{a)Q), where -Q(a)Q is the form 
Q restricted to U and multiplied by -Q{a). In particular over the reals this implies that 

C£° p q (R) = C£ Pjg _i(M) for q > 0, and 

Cf p q {R) ^ C^, p _!(R) for p > 0. 
In the negative-definite case this gives an inclusion C€ n „ (R) C C€ n (R) which extends the sequence 

0,n—l U, n 

RC CCHCH0HC ... 

Likewise, in the complex case, one can show that the even subalgebra of C€ (C) is isomorphic to C€ (C). 
Antiautomorphisms 

In addition to the automorphism a, there are two antiautomorphisms which play an important role in the analysis of 
Clifford algebras. Recall that the tensor algebra T(V) comes with an antiautomorphism that reverses the order in all 
products: 

Vl ® v 2 <8> • • • ® v k h-> v k <g) - - - <g) v 2 <8> Vi- 
Since the ideal / is invariant under this reversal, this operation descends to an antiautomorphism of C£(V,Q) called 
the transpose or reversal operation, denoted by x. The transpose is an antiautomorphism: (xy) 1 = y t x t • The 
transpose operation makes no use of the Z-grading so we define a second antiautomorphism by composing a and 
the transpose. We call this operation Clifford conjugation denoted x 

x = a{x l ) = a(xY. 

Mi 

Of the two antiautomorphisms, the transpose is the more fundamental. 

Note that all of these operations are involutions. One can show that they act as ±1 on elements which are pure in the 
Z-grading. In fact, all three operations depend only on the degree modulo 4. That is, if x is pure with degree k then 

a(x) = ±x x l = ±x x = ±x 
where the signs are given by the following table: 



k mod 4 


0 


1 


2 


3 




a(x) 


+ 




+ 




(-1)" 




+ 


+ 






^_^k(k-l)/2 


X 


+ 






+ 


,_^k(k+l)/2 
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The Clifford scalar product 

When the characteristic is not 2 the quadratic form Q on V can be extended to a quadratic form on all of C£(V,Q) as 
explained earlier (which we also denoted by Q). A basis independent definition is 

Q{x) = {x*x) 

where <a> denotes the scalar part of a (the grade 0 part in the Z-grading). One can show that 

Q(viv 2 "'Vk) = Q{vi)Q{v 2 ) • • • Q(v k ) 

where the v. are elements of V — this identity is not true for arbitrary elements of C£(V,Q). 

The associated symmetric bilinear form on C^(V,g) is given by 
(x,y) = (xty. 

One can check that this reduces to the original bilinear form when restricted to V. The bilinear form on all of 
C£(V,Q) is nondegenerate if and only if it is nondegenerate on V. 

It is not hard to verify that the transpose is the adjoint of left/right Clifford multiplication with respect to this inner 
product. That is, 

(ax,y) = (z, a*y),and 
(xa,y) = {x.ya 1 ). 

Structure of Clifford algebras 

In this section we assume that the vector space V is finite dimensional and that the bilinear form of Q is non-singular. 
A central simple algebra over K is a matrix algebra over a (finite dimensional) division algebra with center K. For 
example, the central simple algebras over the reals are matrix algebras over either the reals or the quaternions. 

• If V has even dimension then C£(V,Q) is a central simple algebra over K. 

• If V has even dimension then C£°(V,Q) is a central simple algebra over a quadratic extension of K or a sum of two 
isomorphic central simple algebras over K. 

• If V has odd dimension then C£(V,Q) is a central simple algebra over a quadratic extension of K or a sum of two 
isomorphic central simple algebras over K. 

• If V has odd dimension then C£°(V,Q) is a central simple algebra over K. 

The structure of Clifford algebras can be worked out explicitly using the following result. Suppose that U has even 
dimension and a non- singular bilinear form with discriminant d, and suppose that V is another vector space with a 
quadratic form. The Clifford algebra of U+V is isomorphic to the tensor product of the Clifford algebras of U and 
(-l) dim ^ /2 JV, which is the space V with its quadratic form multiplied by (-l) dim ^ /2 J. Over the reals, this implies 
in particular that 

Cl p+2 , q (R) = M 2 (E) <g> CZ, iP (R) 
Cl p+1 , q+1 {R) = M 2 (R) ® Cl p<q {R) 
Cl Piq+2 (R)=W®Cl qtP (R). 

These formulas can be used to find the structure of all real Clifford algebras; see the classification of Clifford 
algebras. 

Notably, the Morita equivalence class of a Clifford algebra (its representation theory: the equivalence class of the 
category of modules over it) depends only on the signature p — qmod 8. This is an algebraic form of Bott 
periodicity. 



Clifford algebra 



119 



The Clifford group T 

In this section we assume that V is finite dimensional and the quadratic form Q is nondegenerate. 

The invertible elements of the Clifford algebra act on it by twisted conjugation: conjugation by x maps 

y i— » xya(x) -1 . 

The Clifford group T is defined to be the set of invertible elements x that stabilize vectors, meaning that 

xva{x)~ l G V 
for all v in V. 

This formula also defines an action of the Clifford group on the vector space V that preserves the norm Q, and so 
gives a homomorphism from the Clifford group to the orthogonal group. The Clifford group contains all elements r 
of V of nonzero norm, and these act on V by the corresponding reflections that take v to v - <v,r>r/Q(r) (In 
characteristic 2 these are called orthogonal trans vections rather than reflections.) 

The Clifford group T is the disjoint union of two subsets T° and T 1 , where T l is the subset of elements of degree /. 
The subset T° is a subgroup of index 2 in T. 

If V is a finite dimensional real vector space with positive definite (or negative definite) quadratic form then the 
Clifford group maps onto the orthogonal group of V with respect to the form (by the Cartan-Dieudonne theorem) and 
the kernel consists of the nonzero elements of the field K. This leads to exact sequences 

i->jp _>r->o v (jiQ->i 3 
1 -> k* -> r° -> SO v {K) -> 1. 

Over other fields or with indefinite forms, the map is not in general onto, and the failure is captured by the spinor 
norm. 

Spinor norm 

In arbitrary characteristic, the spinor norm Q is defined on the Clifford group by 
Q(x) = x l x. 

It is a homomorphism from the Clifford group to the group K of non-zero elements of K. It coincides with the 
quadratic form Q of V when V is identified with a subspace of the Clifford algebra. Several authors define the spinor 
norm slightly differently, so that it differs from the one here by a factor of -1, 2, or -2 on T 1 . The difference is not 
very important in characteristic other than 2. 

The nonzero elements of K have spinor norm in the group K 2 of squares of nonzero elements of the field K. So 
when V is finite dimensional and non-singular we get an induced map from the orthogonal group of V to the group 
KIK 2 , also called the spinor norm. The spinor norm of the reflection of a vector r has image Q(r) in K IK 2 , and 
this property uniquely defines it on the orthogonal group. This gives exact sequences: 

1 -> {±1} -> Pm v {K) -> O v {K) -> IT/IT 2 , 

1 -> {±1} -> Spin y (K) -> SO v {K) -> K*/K* 2 . 
Note that in characteristic 2 the group {±1 } has just one element. 

From the point of view of Galois cohomology of algebraic groups, the spinor norm is a connecting homomorphism 
on cohomology. Writing \x 2 for the algebraic group of square roots of 1 (over a field of characteristic not 2 it is 
roughly the same as a two-element group with trivial Galois action), the short exact sequence 

1 — > fjb 2 — > Piny — > O v —> 1 

yields a long exact sequence on cohomology, which begins 

1 -> i/°0i 2 ; if) -> #°(Pm y ; K) -» tf°(CV; tf) -> ^(WJ *0- 
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The 0th Galois cohomology group of an algebraic group with coefficients in K is just the group of K- valued points: 
H°(G] K) = G(K) , and i? 1 (/i 2 ; K) = K*/K* 2 , which recovers the previous sequence 

1 -> {±1} -> Pm v {K) -> Ov(ff) -> K*/K* 2 , 
where the spinor norm is the connecting homomorphism H°(Oy] K) — » i? 1 (/i2j K). 

Spin and Pin groups 

In this section we assume that V is finite dimensional and its bilinear form is non-singular. (If K has characteristic 2 
this implies that the dimension of Vis even.) 

The Pin group Pin^(K) is the subgroup of the Clifford group T of elements of spinor norm 1 , and similarly the Spin 
group Spiriy(K) is the subgroup of elements of Dickson invariant 0 in Pin^iK). When the characteristic is not 2, these 
are the elements of determinant 1. The Spin group usually has index 2 in the Pin group. 

Recall from the previous section that there is a homomorphism from the Clifford group onto the orthogonal group. 
We define the special orthogonal group to be the image of T 0 . If K does not have characteristic 2 this is just the 
group of elements of the orthogonal group of determinant 1. If K does have characteristic 2, then all elements of the 
orthogonal group have determinant 1 , and the special orthogonal group is the set of elements of Dickson invariant 0. 

There is a homomorphism from the Pin group to the orthogonal group. The image consists of the elements of spinor 
norm 1 £ K IK . The kernel consists of the elements +1 and -1, and has order 2 unless K has characteristic 2. 
Similarly there is a homomorphism from the Spin group to the special orthogonal group of V. 

In the common case when V is a positive or negative definite space over the reals, the spin group maps onto the 
special orthogonal group, and is simply connected when V has dimension at least 3. Further the kernel of this 
homomorphism consists of 1 and -l.So in this case the spin group, Spin(n), is a double cover of SO(n). Please note, 
however, that the simple connectedness of the spin group is not true in general: if V is R p,q for p and q both at least 2 
then the spin group is not simply connected. In this case the algebraic group Spin is simply connected as an 
algebraic group, even though its group of real valued points Spin (R) is not simply connected. This is a rather 
subtle point, which completely confused the authors of at least one standard book about spin groups. 

Spinors 

Clifford algebras CD (C), with p+q=2n even, are matrix algebras which have a complex representation of 

n ^ 

dimension 2 . By restricting to the group Pin^ (R) we get a complex representation of the Pin group of the same 
dimension, called the spin representation. If we restrict this to the spin group Spin (R) then it splits as the sum of 

n-l 

two half spin representations (or Weyl representations) of dimension 2 . 

If p+q=2n+l is odd then the Clifford algebra CD (C) is a sum of two matrix algebras, each of which has a 
representation of dimension 2 n , and these are also both representations of the Pin group Pin^ J(JZ). On restriction to 
the spin group Spin^^R) these become isomorphic, so the spin group has a complex spinor representation of 
dimension 2 n . 

More generally, spinor groups and pin groups over any field have similar representations whose exact structure 
depends on the structure of the corresponding Clifford algebras: whenever a Clifford algebra has a factor that is a 
matrix algebra over some division algebra, we get a corresponding representation of the pin and spin groups over 
that division algebra. For examples over the reals see the article on spinors. 
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Real spinors 

To describe the real spin representations, one must know how the spin group sits inside its Clifford algebra. The Pin 

group, Pin is the set of invertible elements in CI which can be written as a product of unit vectors: 
p.q p.q 

Pin P:q = {viv 2 . . . v r \ Vi ? || ^|| = ±1}. 

Comparing with the above concrete realizations of the Clifford algebras, the Pin group corresponds to the products 
of arbitrarily many reflections: it is a cover of the full orthogonal group 0(p,q). The Spin group consists of those 
elements of Pin which are products of an even number of unit vectors. Thus by the Cartan-Dieudonne theorem 

p,q 

Spin is a cover of the group of proper rotations SO(p,q). 

Let a : CI —> CI be the automorphism which is given by -Id acting on pure vectors. Then in particular, Spin^ ^ is the 
subgroup of Pin whose elements are fixed by a. Let 

PA 

Cl° Piq = {x€Cl p , q \a(x)=x}. 

(These are precisely the elements of even degree in CI .) Then the spin group lies within 

The irreducible representations of C£^ ^ restrict to give representations of the pin group. Conversely, since the pin 
group is generated by unit vectors, all of its irreducible representation are induced in this manner. Thus the two 
representations coincide. For the same reasons, the irreducible representations of the spin coincide with the 
irreducible representations of 

PA 

To classify the pin representations, one need only appeal to the classification of Clifford algebras. To find the spin 
representations (which are representations of the even subalgebra), one can first make use of either of the 
isomorphisms (see above) 

C£° 1 Jorq>0 
PA P,q-1 

C£° 1 ,for/7>0 
p.q q>p-i 

and realize a spin representation in signature (p,q) as a pin representation in either signature (p,q-l) or (q,p-l). 



Applications 
Differential geometry 

One of the principal applications of the exterior algebra is in differential geometry where it is used to define the 
bundle of differential forms on a smooth manifold. In the case of a (pseudo-)Riemannian manifold, the tangent 
spaces come equipped with a natural quadratic form induced by the metric. Thus, one can define a Clifford bundle in 
analogy with the exterior bundle. This has a number of important applications in Riemannian geometry. Perhaps 
more importantly is the link to a spin manifold, its associated spinor bundle and spin c manifolds. 



Physics 

Clifford algebras have numerous important applications in physics. Physicists usually consider a Clifford algebra to 
be an algebra spanned by matrices y^,...,y called Dirac matrices which have the property that 

Hlj + liH = 2r lij 

where r| is the matrix of a quadratic form of signature (p,q) — typically (1,3) when working in Minkowski space. 
These are exactly the defining relations for the Clifford algebra Cl^ 3 (C) (up to an unimportant factor of 2), which by 
the classification of Clifford algebras is isomorphic to the algebra of 4 by 4 complex matrices. 

The Dirac matrices were first written down by Paul Dirac when he was trying to write a relativistic first-order wave 
equation for the electron, and give an explicit isomorphism from the Clifford algebra to the algebra of complex 
matrices. The result was used to define the Dirac equation and introduce the Dirac operator. The entire Clifford 
algebra shows up in quantum field theory in the form of Dirac field bilinears. 



Clifford algebra 



122 



Computer Vision 

Recently, Clifford algebras have been applied in the problem of action recognition and classification in computer 
vision. Rodriguez et al. ^ propose a Clifford embedding to generalize traditional MACH filters to video (3D 
spatiotemporal volume), and vector-valued data such as optical flow. Vector-valued data is analyzed using the 
Clifford Fourier transform. Based on these vectors action filters are synthesized in the Clifford Fourier domain and 
recognition of actions is performed using Clifford Correlation. The authors demonstrate the effectiveness of the 
Clifford embedding by recognizing actions typically performed in classic feature films and sports broadcast 
television. 

Notes 

[1] Mathematicians who work with real Clifford algebras and prefer positive definite quadratic forms (especially those working in index theory) 

2 

sometimes use a different choice of sign in the fundamental Clifford identity. That is, they take v = -Q(y). One must replace Q with -Q in 

going from one convention to the other. 
[2] Thus the group algebra K[Z/2] is semisimple and the Clifford algebra splits into eigenspaces of the main involution. 
[3] We are still assuming that the characteristic is not 2. 

[4] The opposite is true when using the alternate (-) sign convention for Clifford algebras: it is the conjugate which is more important. In general, 
the meanings of conjugation and transpose are interchanged when passing from one sign convention to the other. For example, in the 
convention used here the inverse of a vector is given by y~ ^ = j Q(v) while in the (-) convention it is given by 

v- 1 =v/Q(v). 

[5] Rodriguez, Mikel; Shah, M (2008). "Action MACH: A Spatio-Temporal Maximum Average Correlation Height Filter for Action 
Classification". Computer Vision and Pattern Recognition (CVPR). 
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Distributions 



In mathematical analysis, distributions (or generalized functions) are objects that generalize functions. 
Distributions make it possible to differentiate functions whose derivatives do not exist in the classical sense. In 
particular, any locally integrable function has a distributional derivative. Distributions are widely used to formulate 
generalized solutions of partial differential equations. Where a classical solution may not exist or be very difficult to 
establish, a distribution solution to a differential equation is often much easier. Distributions are also important in 
physics and engineering where many problems naturally lead to differential equations whose solutions or initial 
conditions are distributions, such as the Dirac delta distribution. 

Generalized functions were introduced by Sergei Sobolev in 1935. They were re-introduced in the late 1940s by 
Laurent Schwartz, who developed a comprehensive theory of distributions. 

Basic idea 



A 




-1 1 

A typical test function, the bump function It is smooth (infinitely 
differentiable) and has compact support (is zero outside an interval, in this 
case the interval [-1, 1]). 



Distributions are a class of linear functionals that map a set of test functions (conventional and well-behaved 
functions) onto the set of real numbers. In the simplest case, the set of test functions considered is D(R), which is the 
set of functions from R to R having two properties: 

• The function is smooth (infinitely differentiable); 

• The function has compact support (is identically zero outside some interval). 

Then, a distribution d is a mapping from D(R) to R. Instead of writing d( <fi ), where ^ is a test function in D(R), it 
is conventional to write (d, (fi) . A simple example of a distribution is the Dirac delta 6, defined by 

%>) = {6,<p)=<p(0). 

There are straightforward mappings from both locally integrable functions and probability distributions to 
corresponding distributions, as discussed below. However, not all distributions can be formed in this manner. 

Suppose that 

/:R^R 
is a locally integrable function, and let 

if : R^R 

be a test function in D(R). We can then define a corresponding distribution T^.by: 
(T f ,<p) = [ ftpdx. 

This integral is a real number which linearly and continuously depends on <fi . This suggests the requirement that a 
distribution should be linear and continuous over the space of test functions D(R), which completes the definition. In 
a conventional abuse of notation, / may be used to represent both the original function / and the distribution T^, 
derived from it. 
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Similarly, if P is a probability distribution on the reals and <p is a test function, then a corresponding distribution 
may be defined by: 

(T P ,<p) = [ ipdP 
Jn 

Again, this integral continuously and linearly depends on ^ , so that is in fact a distribution. 

Such distributions may be multiplied with real numbers and can be added together, so they form a real vector space. 
In general it is not possible to define a multiplication for distributions, but distributions may be multiplied with 
infinitely differentiable functions. 

It's desirable to choose a definition for the derivative of a distribution which, at least for distributions derived from 
locally integrable functions, has the property that (T^)' = T . If ^ is a test function, we can show that 

(T f ,,<p)= [ f'<pdx=- [ f<p'dx = -(T f ,<p') 
Jn Jn 

using integration by parts and noting that [f , {^)^p{^)]°^ QO — 0, since <^is zero outside of a bounded set. This 
suggests that if S is a distribution, we should define its derivative S' by 

" (S',<p) = -(S, ( p'). 

It turns out that this is the proper definition; it extends the ordinary definition of derivative, every distribution 
becomes infinitely differentiable and the usual properties of derivatives hold. 

Example: Recall that the Dirac delta (so-called Dirac delta function) is the distribution defined by 

(6, ip) = <p(0) 

It is the derivative of the distribution corresponding to the Heaviside step function H: For any test function ip , 

/oc roc 
H(x)<p'(x)dx = - / <p'(x)dx = p(0)-<p(oo) = <p{0) = (6, if) , 
-oc J0 

so (Tjy )' = S . Note, (f(oo) = 0 because of compact support. Similarly, the derivative of the Dirac delta is the 
distribution 

= V(o). 

This latter distribution is our first example of a distribution which is derived from neither a function nor a probability 
distribution. 

Test functions and distributions 

In the sequel, real-valued distributions on an open subset U of will be formally defined. With minor 
modifications, one can also define complex-valued distributions, and one can replace by any (paracompact) 
smooth manifold. 

The first object to define is the space D(U) of test functions on U. Once this is defined, it is then necessary to equip it 
with a topology by defining the limit of a sequence of elements of D(£/). The space of distributions will then be 
given as the space of continuous linear functionals on D(t/). 

Test function space 

The space D(U) of test functions on U is defined as follows. A function <p : U — » R is said to have compact support 
if there exists a compact subset K of U such that <p(x) = 0 for all x in U \ K. The elements of D(U) are the infinitely 
differentiable functions : U —> R with compact support — also known as bump functions. This is a real vector 
space. It can be given a topology by defining the limit of a sequence of elements of D(t/). A sequence ( ) in 
D(U) is said to converge to <p G D(U) if the following two conditions hold (Gelfand & Shilov 1966-1968, v. 1, 
§1.2): 
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• There is a compact set K C U containing the supports of all <p ^: 

[Jsupp(^ fc ) C K. 
k 

• For each multiindex a, the sequence of partial derivatives D a <p k tends uniformly to D a <fi . 

With this definition, D(U) becomes a complete locally convex topological vector space satisfying the Heine-Borel 
property (Rudin 1991, §6.4-5). If U is a countable nested family of open subsets of U with compact closures Ki=Ui 
, then 

D(U) = \jD Ki 

i 

where D / is the set of all smooth functions with support lying in K.. The topology on D(U) is the final topology of 

K i 

the family of nested metric spaces D^z and so D(t/) is an LF-space. The topology is not metrizable by the Baire 
category theorem, since D(U) is the union of subspaces of the first category in D(U) (Rudin 1991, §6.9). 

Distributions 

A distribution on U is a linear functional S : D(t/) -> R (or S : D(U) -> C), such that 

lim S(tp n ) = S I lim tp n ) 

for any convergent sequence <p ^ in D(U). The space of all distributions on U is denoted by D'(t/). Equivalently, the 
vector space D'(£/) is the continuous dual space of the topological vector space D(t/). 

The dual pairing between a distribution S in D'(£/) and a test function ifi in D(£/) is denoted using angle brackets 
thus: 

D'(U) x B(U) 3 (S, if) h-> (5, if) £ R. 
Equipped with the weak-* topology, the space D\U) is a locally convex topological vector space. In particular, a 
sequence (Sp in D'(t/) converges to a distribution S if and only if 

(S k ,<p) ^ {S,<p) 

for all test functions ip . This is the case if and only if converges uniformly to S on all bounded subsets of D(t/). 
(A subset E of D(U) is bounded if there exists a compact subset K of U and numbers such that every (p in £ has 
its support in and has its ft-th derivatives bounded by d^) 

Functions as distributions 

The function / : U —> R is called locally integrable if it is Lebesgue integrable over every compact subset K of U. 
This is a large class of functions which includes all continuous functions and all if functions. The topology on D(t/) 
is defined in such a fashion that any locally integrable function / yields a continuous linear functional on D([/) — 
that is, an element of D'(U) — denoted here by 7^, whose value on the test function <^is given by the Lebesgue 
integral: 

(T fz ip)= [ ftpdx. 
Ju 

Conventionally, one abuses notation by identifying 7^ with j\ provided no confusion can arise, and thus the pairing 
between / and <p is often written 

(f,<p) = (T f ,<p)- 

If / and g are two locally integrable functions, then the associated distributions 7^ and 7^ are equal to the same 
element of D\U) if and only if / and g are equal almost everywhere (see, for instance, Hormander (1983, Theorem 
1.2.5)). In a similar manner, every Radon measure \x on U defines an element of D\U) whose value on the test 
function ^ is J (fd\i. As above, it is conventional to abuse notation and write the pairing between a Radon 
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measure \x and a test function <fi as (fi,(p) . Conversely, essentially by the Riesz representation theorem, every distribution 
which is non-negative on non-negative functions is of this form for some (positive) Radon measure. 

The test functions are themselves locally integrable, and so define distributions. As such they are dense in D\U) with 
respect to the topology on D'(£/) in the sense that for any distribution S G D'(£/), there is a sequence n ^ D(£/) 
such that 

fa,*) -> (M) 

for all ip ED(U). This follows at once from the Hahn— Banach theorem, since by an elementary fact about weak 
topologies the dual of D'(t/) with its weak-* topology is the space D(U) (Rudin 1991, Theorem 3.10). This can also 
be proven more constructively by a convolution argument. 



Operations on distributions 

Many operations which are defined on smooth functions with compact support can also be defined for distributions. 
In general,,, if 

T : D(U) -> D(U) 

is a linear mapping of vector spaces which is continuous with respect to the weak-* topology, then it is possible to 
extend T to a mapping 

T : D'(U) -> D'(U) 

by passing to the limit. (This approach works for more general non-linear mappings as well, provided they are 
assumed to be uniformly continuous.) 

In practice, however, it is more convenient to define operations on distributions by means of the transpose (or adjoint 
transformation) (Strichartz 1994, §2.3; Treves 1967). If T : D(t/) — » D(£/) is a continuous linear operator, then the 
transpose is an operator T :D(U)—>D(U) such that 

(T<p,ip) = {ip,T*ip) 

for all <p , ip G D(£/). If such an operator T exists, and is continuous, then the original operator T may be extended 
to distributions by defining 

Differentiation 

If T : D(U) — » D(U) is given by the partial derivative 

T„ = |^. 
dx k 

By integration by parts, if f and 1]) are in D(U), then 

so that T = -T. This is a continuous linear transformation D(U) —> D(U). So, if S £ D\U) is a distribution, then the 
partial derivative of S with respect to the coordinate is defined by the formula 




\ dxk I \ dxk J 

for all test functions . In this way, every distribution is infinitely differentiable, and the derivative in the direction 
x k is a linear operator on D'(U). In general, if a = (a^ a ) is an arbitrary multi-index and d a denotes the 
associated mixed partial derivative operator, the mixed partial derivative d a S of the distribution S E D'(£/) is defined 
by 

{d"S, if) = (-l)W {S, &*ip) for aU <p G D(C7). 
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Differentiation of distributions is a continuous operator on U(U); this is an important and desirable property that is 
not shared by most other notions of differentiation. 

Multiplication by a smooth function 

If m : U — > R is an infinitely differentiable function and S is a distribution on U, then the product mS is defined by 
(mS)( ( fi) = S(m ) for all test functions tp . This definition coincides with the transpose transformation of 

T m : if h-> rmp 
for (p GD(U). Then, for any test function ip 

(T m ip,i>) = f m(x)<p(x)i>(x)dx = (ip,T m i>) 
J u 

so that T =T . Multiplication of a distribution S by the smooth function m is therefore defined by 

mm 

mSfy) = (mS. ip) = (£, rmj)) = S(mip). 
Under multiplication by smooth functions, D\U) is a module over the ring C°°(D). With this definition of 
multiplication by a smooth function, the ordinary product rule of calculus remains valid. However, a number of 
unusual identities also arise. For example, the Dirac delta distribution 6 is defined on R by (6, tp) = *P (OX an d 
its derivative is given by (6', <p ) =- (S, <fi') =- *P '(0)- However, the product m6' is the distribution 

mS' = m(0)5' - m!8. 

This definition of multiplication also makes it possible to define the operation of a linear differential operator with 
smooth coefficients on a distribution. A linear differential operator takes a distribution S G T>\U) to another 
distribution given by a sum of the form 

ps = p^ s 

\a\<k 

where the coefficients are smooth functions in U. If P is a given differential operator, then the minimum integer k 
for which such an expansion holds for every distribution S is called the order of P. The transpose of P is given by 

The space D\U) is a D-module with respect to the action of the ring of linear differential operators. 

Composition with a smooth function 

Let S be a distribution on an open set U C R^. Let V be an open set in R n , and F : V —> U. Then provided F is a 
submersion, it is possible to define 

SoF£D'(V). 

This is the composition of the distribution S with F, and is also called the pullback of S along F, sometimes written 
F* : S i-> F*S = S o F. 

The pullback is often denoted F , but this notation risks confusion with the above use of '*' to denote the transpose of 
a linear mapping. 

The condition that F be a submersion is equivalent to the requirement that the Jacobian derivative dF(x) of F is a 
surjective linear map for every x E V. A necessary (but not sufficient) condition for extending F # to distributions is 
that F be an open mapping (Hormander 1983, Theorem 6.1.1). The inverse function theorem ensures that a 
submersion satisfies this condition. 

If F is a submersion, then F # is defined on distributions by finding the transpose map. Uniqueness of this extension is 
guaranteed since F # is a continuous linear operator on D(£/). Existence, however, requires using the change of 
variables formula, the inverse function theorem (locally) and a partition of unity argument; see Hormander (1983, 
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Theorem 6.1.2). 

In the special case when F is a diffeomorphism from an open subset V of onto an open subset U of change of 
variables under the integral gives 

J ip o F{x)ifj{x) dx = J if{x)^{F- l {x))\detdF- 1 {x)\dx. 
In this particular case, then, F is defined by the transpose formula: 
{F^S.if) = (S^detdiF-^lipoF- 1 ). 

Localization of distributions 

There is no way to define the value of a distribution in D\U) at a particular point of U. However, as is the case with 
functions, distributions on U restrict to give distributions on open subsets of U. Furthermore, distributions are locally 
determined in the sense that a distribution on all of U can be assembled from a distribution on an open cover of U 
satisfying some compatibility conditions on the overlap. Such a structure is known as a sheaf. 

Restriction 

Let U and V be open subsets of R n with VCU. Let E : D(V) —>D(U) be the operator which extends by zero a 
given smooth function compactly supported in V to a smooth function compactly supported in the larger set U. Then 
the restriction mapping p ^is defined to be the transpose of E yu . Thus for any distribution S G D\U), the restriction 
p S is a distribution in the dual space D'( V) defined by 

(pvuS, if) = (S, E VU ip) 
for all test functions ip G D(V). 

Unless U=V, the restriction to V is neither injective nor surjective. Lack of surjectivity follows since distributions 
can blow up towards the boundary of V. For instance, if U = R and V = (0,2), then the distribution 

S(x) = f^nS(x-^j 

is in D'(V) but admits no extension to D\U). 
Support of a distribution 

Let S £ D'(U) be a distribution on an open set U. Then S is said to vanish on an open set V of U if S lies in the kernel 
of the restriction map p^. Explicitly S vanishes on V if 

(5, = 0 

for all test functions G C°°(U) with support in V. Let V be a maximal open set on which the distribution S 
vanishes; i.e., Vis the union of every open set on which S vanishes. The support of S is the complement of V in U. 
Thus 

su PP 5 = C/-|J{^|^ [ /5 = 0}. 

The distribution S has compact support if its support is a compact set. Explicitly, S has compact support if there is a 
compact subset K of U such that for every test function <p whose support is completely outside of K, we have S( <p 
) = 0. Compactly supported distributions define continuous linear functions on the space C°°(U) m , the topology on 
C°°(U) is defined such that a sequence of test functions <fi k converges to 0 if and only if all derivatives of ¥ k 
converge uniformly to 0 on every compact subset of U. Conversely, it can be shown that every continuous linear 
functional on this space defines a distribution of compact support. 
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Tempered distributions and Fourier transform 

By using a larger space of test functions, one can define the tempered distributions, a subspace of D'(R W ). These 
distributions are useful if one studies the Fourier transform in generality: all tempered distributions have a Fourier 
transform, but not all distributions have one. 

The space of test functions employed here, the so-called Schwartz space S(R n ), is the function space of all infinitely 
differentiable functions that are rapidly decreasing at infinity along with all partial derivatives. Thus <fi : — » R is 
in the Schwartz space provided that any derivative of <p , multiplied with any power of bcl, converges towards 0 for 
\x\—>oo. These functions form a complete topological vector space with a suitably defined family of seminorms. 
More precisely, let 

p atP (<p) = sup \x a D^(x)\ 

for a, (3 multi-indices of size n. Then <fi is a Schwartz function if all the values 

p a A<p) < °°- 

The family of seminorms ^ defines a locally convex topology on the Schwartz- space. The seminorms are, in fact, 
norms on the Schwartz space, since Schwartz functions are smooth. The Schwartz space is metrizable and complete. 

The space of tempered distributions is defined as the (continuous) dual of the Schwartz space. In other words, a 
distribution F is a tempered distribution if and only if 

lim F(<p m ) = 0. 

is true whenever, 

holds for all multi-indices a, (3. 

The derivative of a tempered distribution is again a tempered distribution. Tempered distributions generalize the 
bounded (or slow-growing) locally integrable functions; all distributions with compact support and all 
square-integrable functions are tempered distributions. All locally integrable functions / with at most polynomial 
growth, i.e. such that/(x) = 0(lxl r ) for some r, are tempered distributions. This includes all functions in L p (R n ) for 

P >\. 

The tempered distributions can also be characterized as slowly growing. This characterization is dual to the rapidly 
falling behaviour, e.g. oc • exp(— x 2 ) , of the test functions. 

To study the Fourier transform, it is best to consider complex-valued test functions and complex-linear distributions. 
The ordinary continuous Fourier transform F yields then an automorphism of Schwartz function space, and we can 
define the Fourier transform of the tempered distribution S by (FS)(\|0 = S(Fty) for every test function ip. FS is 
thus again a tempered distribution. The Fourier transform is a continuous, linear, bijective operator from the space of 
tempered distributions to itself. This operation is compatible with differentiation in the sense that 

dS . 
F— = ixFS 
dx 

and also with convolution: if S is a tempered distribution and ip is a slowly increasing infinitely differentiable 
function on R^ (meaning that all derivatives of grow at most as fast as polynomials), then Sty is again a tempered 
distribution and 

F(SV) = FS*FiP 

is the convolution of FS and Fty. 
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Convolution 

Under some circumstances, it is possible to define the convolution of a function with a distribution, or even the 
convolution of two distributions. 

Convolution of a test function with a distribution 

If/G D(R W ) is a compactly supported smooth test function, then convolution with/defines an operator 

C f : D(R n ) -> D(R n ) 

defined by =f*g, which is linear (and continuous with respect to the LF space topology on D(R W ).) 

Convolution of / with a distribution S £ D'(R W ) can be defined by taking the transpose of relative to the duality 
pairing of D(R n ) with the space D'(R n ) of distributions (Treves 1967, Chapter 27). If/, g, if G D(R n ), then by 
Fubini's theorem 

(C f g, <p)= <p{x) / f(x - y)g(y) dydx = {g, Cpp) 
where f(x)=f(-x) . Extending by continuity, the convolution off with a distribution S is defined by 

(f*S,<p) = {Sj*<p) 

for all test functions <p G D(R"). 

An alternative way to define the convolution of a function /and a distribution S is to use the translation operator 
defined on test functions by 

r x tp(y) = tp(y - x) 

and extended by the transpose to distributions in the obvious way (Rudin 1991, §6.29). The convolution of the 
compactly supported function /and the distribution S is then the function defined for each x G R n by 

(f*S)(x) = {S,rJ). 

It can be shown that the convolution of a compactly supported function and a distribution is a smooth function. If the 
distribution S has compact support as well, then f*S is a compactly supported function, and the Titchmarsh 
convolution theorem (Hormander 1983, Theorem 4.3.3) implies that 

ch(/ * S) = chf + chS 

where ch denotes the convex hull. 
Distribution of compact support 

It is also possible to define the convolution of two distributions S and T on R^, provided one of them has compact 
support. Informally, in order to define S*T where T has compact support, the idea is to extend the definition of the 
convolution * to a linear operation on distributions so that the associativity formula 

S * (T * (p) = (S *T) * (f 
continues to hold for all test-functions ip . Hormander (1983, §IV.2) proves the uniqueness of such an extension. 

It is also possible to provide a more explicit characterization of the convolution of distributions (Treves 1967, 
Chapter 27). Suppose that it is Tthat has compact support. For any test function <fi in D(R W ), consider the function 

1>(x) = {T,T- x <p). 

It can be readily shown that this defines a smooth function of x, which moreover has compact support. The 
convolution of S and T is defined by 

(S*T,<p) = (S,iP). 

This generalizes the classical notion of convolution of functions and is compatible with differentiation in the 
following sense: 

&*{S *T) = (&*S) *T = S* (d a T). 



Distributions 



131 



This definition of convolution remains valid under less restrictive assumptions about S and T\ see for instance 
Gel'fand & Shilov (1966-1968, v. 1, pp. 103-104) and Benedetto (1997, Definition 2.5.8). 

Distributions as derivatives of continuous functions 

The formal definition of distributions exhibits them as a subspace of a very large space, namely the algebraic dual of 
D([/) (or S(R^) for tempered distributions). It is not immediately clear from the definition how exotic a distribution 
might be. To answer this question, it is instructive to see distributions built up from a smaller space, namely the 
space of continuous functions. Roughly, any distribution is locally a (multiple) derivative of a continuous function. A 
precise version of this result, given below, holds for distributions of compact support, tempered distributions, and 
general distributions. Generally speaking, no proper subset of the space of distributions contains all continuous 
functions and is closed under differentiation. This says that distributions are not particularly exotic objects; they are 
only as complicated as necessary. 

Tempered distributions 

If / G S'(R n ) is a tempered distribution, then there exists a constant C > 0, and positive integers M and N such that for 
all Schwartz functions <f eS(R n ) 

(fM<c £ su p = c E p«A<p)- 

\ct\<N,\p\<M xeUn \<x\<N : \P\<M 
This estimate along with some techniques from functional analysis can be used to show that there is a continuous 
slowly increasing function F and a multiindex a such that 

/ = D a F. 

Compactly supported distributions 

Let U be an open set, and K a compact subset of U. If / is a distribution supported on K, then there is a continuous 
function F compactly supported in U (possibly on a larger set than K itself) such that 

f = D a F 

for some multi-index a. This follows from the previously quoted result on tempered distributions by means of a 
localization argument. 

Distributions with point support 

If / has support at a single point {P}, then / is in fact a finite linear combination of distributional derivatives of the 6 
function at P. That is, there exists an integer m and complex constants a for multi indices loci < m such that 

/= £ o, a D a {r P 5) 

|c*|<m 

where x p is the translation operator. 
General distributions 

A version of the above theorem holds locally in the following sense (Rudin 1991). Let S be a distribution on U. Then 
one can find for every multi-index a a continuous function g^ such that 

S = J2D a 9a 

a 

and that any compact subset K of U intersects the supports of only finitely many g^; therefore, to evaluate the value 
of S for a given smooth function / compactly supported in U, we only need finitely many g^ hence the infinite sum 
above is well-defined as a distribution. If the distribution S is of finite order, then one can choose g^ in such a way 
that only finitely many of them are nonzero. 
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Using holomorphic functions as test functions 

The success of the theory led to investigation of the idea of hyperf unction, in which spaces of holomorphic functions 
are used as test functions. A refined theory has been developed, in particular Mikio Sato's algebraic analysis, using 
sheaf theory and several complex variables. This extends the range of symbolic methods that can be made into 
rigorous mathematics, for example Feynman integrals. 



Problem of multiplication 

A possible limitation of the theory of distributions (and hyperf unctions) is that it is a purely linear theory, in the 
sense that the product of two distributions cannot consistently be defined (in general), as has been proved by Laurent 
Schwartz in the 1950s. For example, if p.v. l/x is the distribution obtained by the Cauchy principal value 

(p.v±)[<t>}= lim / ^-dx 
for all <fi E S(R), and 6 is the Dirac delta distribution then 

(5 X x) X p.v.- = 0 
x 

but 

S X (^x X p.v.-^j = S 

so the product of a distribution by a smooth function (which is always well defined) cannot be extended to an 
associative product on the space of distributions. 

Thus, nonlinear problems cannot be posed in general and thus not solved within distribution theory alone. In the 
context of quantum field theory, however, solutions can be found. In more than two spacetime dimensions the 
problem is related to the regularization of divergences. Here Henri Epstein and Vladimir Glaser developed the 
mathematically rigorous (but extremely technical) causal perturbation theory. This does not solve the problem in 
other situations. Many other interesting theories are non linear, like for example Navier-Stokes equations of fluid 
dynamics. 

In view of this, several not entirely satisfactory theories of algebras of generalized functions have been developed, 
among which Colombeau's (simplified) algebra is maybe the most popular in use today. 

A simple solution of the multiplication problem is dictated by the path integral formulation of quantum mechanics. 
Since this is required to be equivalent to the Schrodinger theory of quantum mechanics which is invariant under 
coordinate transformations, this property must be shared by path integrals. This fixes all products of distributions as 
shown by Kleinert & Chervyakov (2001) The result is equivalent to what can be derived from dimensional 
regularization (Kleinert & Chervyakov 2000). 
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Hilbert space 




Hilbert spaces can be used to study the harmonics 
of vibrating strings. 



The mathematical concept of a Hilbert space, named after David 
Hilbert, generalizes the notion of Euclidean space. It extends the 
methods of vector algebra and calculus from the two-dimensional 
Euclidean plane and three-dimensional space to spaces with any finite 
or infinite number of dimensions. A Hilbert space is an abstract vector 
space possessing the structure of an inner product that allows length 
and angle to be measured. Furthermore, Hilbert spaces are required to 
be complete, a property that stipulates the existence of enough limits in 
the space to allow the techniques of calculus to be used. 

Hilbert spaces arise naturally and frequently in mathematics, physics, 
and engineering, typically as infinite-dimensional function spaces. The 
earliest Hilbert spaces were studied from this point of view in the first 
decade of the 20th century by David Hilbert, Erhard Schmidt, and 
Frigyes Riesz. They are indispensable tools in the theories of partial differential equations, quantum mechanics, 
Fourier analysis (which includes applications to signal processing and heat transfer) and ergodic theory which forms 
the mathematical underpinning of the study of thermodynamics. John von Neumann coined the term "Hilbert space" 
for the abstract concept underlying many of these diverse applications. The success of Hilbert space methods ushered 
in a very fruitful era for functional analysis. Apart from the classical Euclidean spaces, examples of Hilbert spaces 
include spaces of square-integrable functions, spaces of sequences, Sobolev spaces consisting of generalized 
functions, and Hardy spaces of holomorphic functions. 

Geometric intuition plays an important role in many aspects of Hilbert space theory. Exact analogs of the 
Pythagorean theorem and parallelogram law hold in a Hilbert space. At a deeper level, perpendicular projection onto 
a subspace (the analog of "dropping the altitude" of a triangle) plays a significant role in optimization problems and 
other aspects of the theory. An element of a Hilbert space can be uniquely specified by its coordinates with respect to 
a set of coordinate axes (an orthonormal basis), in analogy with Cartesian coordinates in the plane. When that set of 
axes is countably infinite, this means that the Hilbert space can also usefully be thought of in terms of infinite 
sequences that are square- summable. Linear operators on a Hilbert space are likewise fairly concrete objects: in good 
cases, they are simply transformations that stretch the space by different factors in mutually perpendicular directions 
in a sense that is made precise by the study of their spectral theory. 



Definition and illustration 

Motivating example: Euclidean space 

One of the most familiar examples of a Hilbert space is the Euclidean space consisting of three-dimensional vectors, 

3 

denoted by R , and equipped with the dot product. The dot product takes two vectors x and y, and produces a real 
number xy. If x and y are represented in Cartesian coordinates, then the dot product is defined by 

(x u x 2 , x 3 ) • (y u y 2) jft) = rcij/i + x 2 y 2 + x 3 y 3 . 
The dot product satisfies the properties: 

1. It is symmetric in x and y: x y = y x. 

2. It is linear in its first argument: (ax^ + bx^-y = ax^-y + bx^y for any scalars a, b, and vectors x^ x^ and y. 

3. It is positive definite: for all vectors x, x x > 0 with equality if and only if x = 0. 
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An operation on pairs of vectors that, like the dot product, satisfies these three properties is known as a (real) inner 
product. A vector space equipped with such an inner product is known as a (real) inner product space. Every 
finite-dimensional inner product space is also a Hilbert space. The basic feature of the dot product that connects it 
with Euclidean geometry is that it is related to both the length (or norm) of a vector, denoted llxll, and to the angle 9 
between two vectors x and y by means of the formula 

x • y = ||x|| ||y|| cos0. 
Multivariable calculus in Euclidean space relies on the ability to 
compute limits, and to have useful criteria for concluding that limits 
exist. A mathematical series 




Completeness means that if a particle moves 
along the broken path (in blue) travelling a finite 
total distance, then the particle has a well-defined 
net displacement (in orange). 



oc 

71=0 



3 

consisting of vectors in R is absolutely convergent provided that the sum of the lengths converges as an ordinary 
series of real numbers: ^ 

5^||x fc || < oo. 

fc=0 

Just as with a series of scalars, a series of vectors that converges absolutely also converges to some limit vector L in 
the Euclidean space, in the sense that 



N 



0 as N — > oo. 



This property expresses the completeness of Euclidean space: that a series which converges absolutely also 
converges in the ordinary sense. 



Definition 

A Hilbert space H is a real or complex inner product space that is also a complete metric space with respect to the 

T21 

distance function induced by the inner product. To say that H is a complex inner product space means that H is a 
complex vector space on which there is an inner product (x,y) associating a complex number to each pair of elements 
x,y of H that satisfies the following properties: 

• (y,x) is the complex conjugate of (x,y)\ 

(y,x) = (x : y). 

ran 

• (x,y) is linear in its first argument. For all complex numbers a and b, 

(ax 1 + bx 2 ,y) = a{x u y) + b(x 2 ,y). 

• The inner product is positive definite: 

(x,x) > 0 

where the case of equality holds precisely when x = 0. 
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It follows from properties 1 and 2 that a complex inner product is antilinear in its second argument, meaning that 

(x,ay 1 + by 2 ) = ofoyi) + b{x,y 2 ). 
A real inner product space is defined in the same way, except that H is a real vector space and the inner product takes 
real values. Such an inner product will be bilinear: that is, linear in each argument. 

The norm defined by the inner product (•,•) is the real- valued function 
\\ x \\ = \/( x , x )i 

and the distance between two points x,y in H is defined in terms of the norm by 

d(x, y) = \\x - y || = ^ {x - y, x - y). 
That this function is a distance function means (1) that it is symmetric in x and y, (2) that the distance between x and 
itself is zero, and otherwise the distance between x and y must be positive, and (3) that the triangle inequality holds, 
meaning that the length of one leg of a triangle xyz cannot exceed the sum of the lengths of the other two legs: 

d(x,z) < d(x,y) + d(y 9 z). 







\d(y.z) 


x u - 













This last property is ultimately a consequence of the more fundamental Cauchy-Schwarz inequality, which asserts 

\{x,y)\ < Ml ||y|| 

with equality if and only if x and y are linearly dependent. 

Relative to a distance function defined in this way, any inner product space is a metric space, and sometimes is 
known as a pre-Hilbert space A pre-Hilbert space is a Hilbert space if in addition it is complete. Completeness is 
expressed using a form of the Cauchy criterion for sequences in H: a pre-Hilbert space H is complete if every 
Cauchy sequence converges with respect to this norm to an element in the space. Completeness can be characterized 
by the following equivalent condition: if a series of vectors YlkLo u k converges absolutely in the sense that 

oc 

]T \\u k \\ < OO, 

then the series converges in H, in the sense that the partial sums converge to an element of H. 

As a complete normed space, Hilbert spaces are by definition also Banach spaces. As such they are topological 
vector spaces, in which topological notions like the openness and closedness of subsets are well-defined. Of special 
importance is the notion of a closed linear subspace of a Hilbert space which, with the inner product induced by 
restriction, is also complete (being a closed set in a complete metric space) and therefore a Hilbert space in its own 
right. 
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Second example: sequence spaces 

2 

The sequence space D consists of all infinite sequences z = (z^,...) of complex numbers such that the series 

oo 

71=1 

2 

converges. The inner product on D is defined by 

oo 

71=1 

with the latter series converging as a consequence of the Cauchy-Schwarz inequality. 

2 

Completeness of the space holds provided that whenever a series of elements from D converges absolutely (in norm), 

2 

then it converges to an element of D . The proof is basic in mathematical analysis, and permits mathematical series of 
elements of the space to be manipulated with the same ease as series of complex numbers (or vectors in a 
finite-dimensional Euclidean space) 



History 

Prior to the development of Hilbert spaces, other generalizations of 

Euclidean spaces were known to mathematicians and physicists. In 

particular, the idea of an abstract linear space had gained some traction 

towards the end of the 19th century -J® this is a space whose elements 

can be added together and multiplied by scalars (such as real or 

complex numbers) without necessarily identifying these elements with 

"geometric" vectors, such as position and momentum vectors in 

physical systems. Other objects studied by mathematicians at the turn 

of the 20th century, in particular spaces of sequences (including series) 
T71 

and spaces of functions, can naturally be thought of as linear spaces. 
Functions, for instance, can be added together or multiplied by 
constant scalars, and these operations obey the algebraic laws satisfied 
by addition and scalar multiplication of spatial vectors. 

In the first decade of the 20th century, parallel developments led to the 

introduction of Hilbert spaces. The first of these was the observation, 

which arose during David Hilbert and Erhard Schmidt's study of 
rsi 

integral equations, that two square-integrable real-valued functions/ 
and g on an interval [a,b] have an inner product 

(f,9) = / f(x)g(x)dx 

J a 

which has many of the familiar properties of the Euclidean dot product. In particular, the idea of an orthogonal 
family of functions has meaning. Schmidt exploited the similarity of this inner product with the usual dot product to 
prove an analog of the spectral decomposition for an operator of the form 

f b 

f(x) h-> / K(x,y)f(y)dy 

J a 

where K is a continuous function symmetric in x and y. The resulting eigenfunction expansion expresses the function 
K as a series of the form 

71 
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where the functions w are orthogonal in the sense that (w ,w ) = 0 for all n ± m. The individual terms in this 

n n m 

series are sometimes referred to as elementary product solutions. However, there are eigenfunction expansions which 
fail to converge in a suitable sense to a square-integrable function: the missing ingredient, which ensures 
convergence, is completeness. 1 

The second development was the Lebesgue integral, an alternative to the Riemann integral introduced by Henri 
Lebesgue in 1904.^ The Lebesgue integral made it possible to integrate a much broader class of functions. In 1907, 

2 

Frigyes Riesz and Ernst Sigismund Fischer independently proved that the space L of square Lebesgue-integrable 
functions is a complete metric spaced 1 ^ As a consequence of the interplay between geometry and completeness, the 
19th century results of Joseph Fourier, Friedrich Bessel and Marc-Antoine Parseval on trigonometric series easily 
carried over to these more general spaces, resulting in a geometrical and analytical apparatus now usually known as 
the Riesz-Fischer theorem J 12 ^ 

Further basic results were proved in the early 20th century. For example, the Riesz representation theorem was 

ri3i 

independently established by Maurice Frechet and Frigyes Riesz in 1907. John von Neumann coined the term 

abstract Hilbert space in his work on unbounded Hermitian operators J 14 ^ Although other mathematicians such as 

Hermann Weyl and Norbert Wiener had already studied particular Hilbert spaces in great detail, often from a 

physically-motivated point of view, von Neumann gave the first complete and axiomatic treatment of themJ 15] Von 

Neumann later used them in his seminal work on the foundations of quantum mechanics, ^ 16 ^ and in his continued 

work with Eugene Wigner. The name "Hilbert space" was soon adopted by others, for example by Hermann Weyl in 

ri7i 

his book on quantum mechanics and the theory of groups. 

The significance of the concept of a Hilbert space was underlined with the realization that it offers one of the best 

ri8i 

mathematical formulations of quantum mechanics. In short, the states of a quantum mechanical system are 

vectors in a certain Hilbert space, the observables are hermitian operators on that space, the symmetries of the 

system are unitary operators, and measurements are orthogonal projections. The relation between quantum 

mechanical symmetries and unitary operators provided an impetus for the development of the unitary representation 

ri7i 

theory of groups, initiated in the 1928 work of Hermann Weyl. On the other hand, in the early 1930s it became 
clear that certain properties of classical dynamical systems can be analyzed using Hilbert space techniques in the 
framework of ergodic theory J 

The algebra of observables in quantum mechanics is naturally an algebra of operators defined on a Hilbert space, 
according to Werner Heisenberg's matrix mechanics formulation of quantum theory. Von Neumann began 
investigating operator algebras in the 1930s, as rings of operators on a Hilbert space. The kind of algebras studied by 
von Neumann and his contemporaries are now known as von Neumann algebras. In the 1940s, Israel Gelfand, Mark 
Naimark and Irving Segal gave a definition of a kind of operator algebras called C*-algebras that on the one hand 
made no reference to an underlying Hilbert space, and on the other extrapolated many of the useful features of the 
operator algebras that had previously been studied. The spectral theorem for self-adjoint operators in particular that 
underlies much of the existing Hilbert space theory was generalized to C*-algebras. These techniques are now basic 
in abstract harmonic analysis and representation theory. 
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Examples 

Lebesgue spaces 

Lebesgue spaces are function spaces associated to measure spaces (X, M, fi), where X is a set, M is a a-algebra of 
subsets of X, and fi is a countably additive measure on M. Let L (X,\x) be the space of those complex- valued 
measurable functions on X for which the Lebesgue integral of the square of the absolute value of the function is 
finite, i.e., for a function fin L 2 (X,\i), 

\f\ 2 dfJ. < 00, 

and where functions are identified if and only if they differ only on a set of measure zero. 

2 

The inner product of functions /and g in L (X,\x) is then defined as 

(/,<?>= / f{tW)M*)- 

J X 

2 

For /and g in L , this integral exists because of the Cauchy-Schwarz inequality, and defines an inner product on the 

2 r20i 

space. Equipped with this inner product, L is in fact complete. The Lebesgue integral is essential to ensure 

T211 

completeness: on domains of real numbers, for instance, not enough functions are Riemann integrable. 

2 2 

The Lebesgue spaces appear in many natural settings. The spaces L (R) and L ([0,1]) of square-integrable functions 
with respect to the Lebesgue measure on the real line and unit interval, respectively, are natural domains on which to 
define the Fourier transform and Fourier series. In other situations, the measure may be something other than the 
ordinary Lebesgue measure on the real line. For instance, if w is any positive measurable function, the space of all 
measurable functions /on the interval [0,1] satisfying 

f 1 \f(t)\ 2 w(t) dt < oo 
Jo 

is called the weighted L 2 space L % „ ([0,1]), and w is called the weight function. The inner product is defined by 

</,<?)= f f(t)W)w{t)dt. 
Jo 

The weighted space L ^,,([0,1]) is identical with the Hilbert space L 2 ([0,l],|i) where the measure \x of a 
Lebesgue-measurable set A is defined by 

fi(A) = [ w(t)dt. 

J A 

2 

Weighted L spaces like this are frequently used to study orthogonal polynomials, because different families of 
orthogonal polynomials are orthogonal with respect to different weighting functions. 



Sobolev spaces 

s s 2 

Sobolev spaces, denoted by H or W ' , are Hilbert spaces. These are a special kind of function space in which 

differentiation may be performed, but which (unlike other Banach spaces such as the Holder spaces) support the 

structure of an inner product. Because differentiation is permitted, Sobolev spaces are a convenient setting for the 

T221 

theory of partial differential equations. They also form the basis of the theory of direct methods in the calculus of 
variations/ 23 ^ 

For s a non-negative integer and £2 C R^, the Sobolev space ffid) contains L 2 functions whose weak derivatives of 
order up to s are also L 2 . The inner product in 7/ s (£2) is 

</,<?}= / f(x)g(x)dx+ f Df-Dg(x)dx + ...+ f D s f(x) ■ D s g{x)dx 
Jo, Jn Jn 

where the dot indicates the dot product in the Euclidean space of partial derivatives of each order. Sobolev spaces 
can also be defined when s is not an integer. 
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Sobolev spaces are also studied from the point of view of spectral theory, relying more specifically on the Hilbert 
space structure. If £1 is a suitable domain, then one can define the Sobolev space H s (£l) as the space of Bessel 
potentials;^ roughly, 

F(n) = {(l-A)- s / 2 /|/a 2 (n)}. 

—s/2 

Here A is the Laplacian and (1 - A) is understood in terms of the spectral mapping theorem. Apart from 
providing a workable definition of Sobolev spaces for non-integer s, this definition also has particularly desirable 
properties under the Fourier transform that make it ideal for the study of pseudodifferential operators. Using these 
methods on a compact Riemannian manifold, one can obtain for instance the Hodge decomposition which is the 
basis of Hodge theory. 1 

Spaces of holomorphic functions 

Hardy spaces 

The Hardy spaces are function spaces, arising in complex analysis and harmonic analysis, whose elements are 

certain holomorphic functions in a complex domain J 26 -' Let U denote the unit disc in the complex plane. Then the 

2 

Hardy space H (U) is defined to be the space of holomorphic functions /on U such that the means 



1 r 2lT 

M r(f) = ^J 0 \f(re i6 )\ 2 de 



remain bounded for r < 1 . The norm on this Hardy space is defined by 



a = lim yjMriJ). 

es in the disc a 
oc 

f(z) = £ a„z" 



2 

Hardy spaces in the disc are related to Fourier series. A function /is in H (U) if and only if 



71=0 

where 



£ M 2 < oo. 



71=0 

2 2 
Thus H (U) consists of those functions which are L on the circle, and whose negative frequency Fourier coefficients 

vanish. 

Bergman spaces 

[27] 

The Bergman spaces are another family of Hilbert spaces of holomorphic functions. Let D be a bounded open set 

2 h 

in the complex plane (or a higher dimensional complex space) and let L ' (D) be the space of holomorphic functions 

2 

/in D that are also in L (D) in the sense that 



= / |/(*)| 2 d/x(z)<oo, 

JD 



2 h 2 

where the integral is taken with respect to the Lebesgue measure in D. Clearly L ' (D) is a subspace of L {D)\ in fact, 
it is a closed subspace, and so a Hilbert space in its own right. This is a consequence of the estimate, valid on 
compact subsets K of D, that 

Biipl/tol^Cjcll/Ha, 
zeK 

which in turn follows from Cauchy's integral formula. Thus convergence of a sequence of holomorphic functions in 

2 

L (D) implies also compact convergence, and so the limit function is also holomorphic. Another consequence of this 

2 h 

inequality is that the linear functional that evaluates a function /at a point of D is actually continuous on L ' (D). 

2 h 

The Riesz representation theorem implies that the evaluation functional can be represented as an element of L ' (D). 

2 h 

Thus, for every z £ D, there is a function ij^EL' (D) such that 
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M= [ /(Cto.(C)4rtC) 

Jd 

for all/E L 2,h (D). The integrand 

* (C, *) = £G5 

is known as the Bergman kernel of Z). This integral kernel satisfies a reproducing property 

/(*)= / /(c)Jf(C,*)^(0- 

JD 

A Bergman space is an example of a reproducing kernel Hilbert space, which is a Hilbert space of functions along 

2 

with a kernel K(C,,z) that verifies a reproducing property analogous to this one. The Hardy space H (D) also admits a 
reproducing kernel, known as the Szego kernel Reproducing kernels are common in other areas of mathematics 
as well. For instance, in harmonic analysis the Poisson kernel is a reproducing kernel for the Hilbert space of 
square-integrable harmonic functions in the unit ball. That the latter is a Hilbert space at all is a consequence of the 
mean value theorem for harmonic functions. 

Applications 

Many of the applications of Hilbert spaces exploit the fact that Hilbert spaces support generalizations of simple 
geometric concepts like projection and change of basis from their usual finite dimensional setting. In particular, the 
spectral theory of continuous self-adjoint linear operators on a Hilbert space generalizes the usual spectral 
decomposition of a matrix, and this often plays a major role in applications of the theory to other areas of 
mathematics and physics. 

Sturm-Liouville theory 

In the theory of ordinary differential equations, spectral methods on a 
suitable Hilbert space are used to study the behavior of eigenvalues and 
eigenfunctions of differential equations. For example, the 
Sturm-Liouville problem arises in the study of the harmonics of waves 
in a violin string or a drum, and is a central problem in ordinary 
differential equations. The problem is a differential equation of the 
form 




The overtones of a vibrating string. These are 
eigenfunctions of an associated Sturm-Liouville 
problem. The eigenvalues 1,1/2,1/3,... form the 
(musical) harmonic series. 



d 

dx 



i \ d y 



+ q(x)y = Xw(x)y 



for an unknown function y on an interval [a,b], satisfying general homogeneous Robin boundary conditions 

(ay(a) + a'y'(a) = 0 
\{3y(b) + f3'y'(b) = 0. 

The functions p, q, and w are given in advance, and the problem is to find the function y and constants X for which 
the equation has a solution. The problem only has solutions for certain values of X, called eigenvalues of the system, 
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and this is a consequence of the spectral theorem for compact operators applied to the integral operator defined by 
the Green's function for the system. Furthermore, another consequence of this general result is that the eigenvalues X 
of the system can be arranged in an increasing sequence tending to infinity P 0 ^ 

Partial differential equations 

[221 

Hilbert spaces form a basic tool in the study of partial differential equations. For many classes of partial 

differential equations, such as linear elliptic equations, it is possible to consider a generalized solution (known as a 

weak solution) by enlarging the class of functions. Many weak formulations involve the class of Sobolev functions, 

which is a Hilbert space. A suitable weak formulation reduces to a geometrical problem the analytic problem of 

finding a solution or, often what is more important, showing that a solution exists and is unique for given boundary 

data. For linear elliptic equations, one geometrical result that ensures unique solvability for a large class of problems 

is the Lax-Milgram theorem. This strategy forms the rudiment of the Galerkin method (a finite element method) for 

T311 

numerical solution of partial differential equations. 

2 

A typical example is the Poisson equation -Au = g with Dirichlet boundary conditions in a bounded domain Q in R . 
The weak formulation consists of finding a function u such that, for all continuously differentiable functions v in £1 
vanishing on the boundary: 



f Vu-Vv= [ 
Jn Jn 



gv. 



This can be recast in terms of the Hilbert space H consisting of functions u such that u, along with its weak 

partial derivatives, are square integrable on Q, and which vanish on the boundary. The question then reduces to 
finding u in this space such that for all v in this space 
a(it, v) = b(v) 

where a is a continuous bilinear form, and b is a continuous linear functional, given respectively by 



a(u, v) = I Vu • Vv, b(v) = / gv. 



Since the Poisson equation is elliptic, it follows from Poincare's inequality that the bilinear form a is coercive. The 
Lax-Milgram theorem then ensures the existence and uniqueness of solutions of this equation. 

Hilbert spaces allow for many elliptic partial differential equations to be formulated in a similar way, and the 
Lax-Milgram theorem is then a basic tool in their analysis. With suitable modifications, similar techniques can be 
applied to parabolic partial differential equations and certain hyperbolic partial differential equations. 



Ergodic theory 




The field of ergodic theory is the study of the long-term behavior of 
chaotic dynamical systems. The protypical case of a field to which 
ergodic theory is applicable is that of thermodynamics in which, 
although the microscopic state of a system is extremely 
complicated — it is impossible to understand the ensemble of individual 
collisions between particles of matter — the average behavior over 
sufficiently long time intervals is tractable. The laws of 
thermodynamics are assertions about such average behavior. In 
particular, one formulation of the zeroth law of thermodynamics 
asserts that over sufficiently long timescales, the only functionally 

independent measurement that one can make of a thermodynamic system in equilibrium is its total energy, in the 
form of temperature. 



The path of a billiard ball in the Bunimovich 
stadium is described by an ergodic dynamical 
system. 
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An ergodic dynamical system is one for which, apart from the energy — measured by the Hamiltonian — there are no 
other functionally independent conserved quantities on the phase space. More explicitly, suppose that the energy E is 
fixed, and let Q £ be the subset of the phase space consisting of all states of energy E (an energy surface), and let !T 
denote the evolution operator on the phase space. The dynamical system is ergodic if there are no continuous 

non-constant functions onH, such that 

E 

f(T t w) = f(w) 

for all w on Q and all time t. Liouville's theorem implies that there exists a measure [i on the energy surface that is 

invariant under the time translation. As a result, time translation is a unitary transformation of the Hilbert space 

2 

L (£l E ,\i) consisting of square-integrable functions on the energy surface Q with respect to the inner product 

{f,g)L*(n B3 p) = / fgdp> 

J E 

The von Neumann mean ergodic theorem^ states the following: 

• If U is a (strongly continuous) one-parameter semigroup of unitary operators on a Hilbert space H, and P is the 
orthogonal projection onto the space of common fixed points of £/ , {xGH I Ux = x for all t > 0}, then 

1 f T 

Px— lim — / U t xdt. 

T^oc T Jo 

For an ergodic system, the fixed set of the time evolution consists only of the constant functions, so the ergodic 

[32] 2 
theorem implies the following: for any function /G L (Q. 9 \i), 

L 2 -limi f f(T t w)dt= f f(y)dfi(y). 

That is, the long time average of an observable /is equal to its expectation value over an energy surface. 

Fourier analysis 



One of the basic goals of Fourier analysis is to decompose a function 
into a (possibly infinite) linear combination of given basis functions: 
the associated Fourier series. The classical Fourier series associated to 
a function /defined on the interval [0,1] is a series of the form 




Superposition of sinusoidal wave basis functions 
(bottom) to form a sawtooth wave (top) 
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Spherical harmonics, an orthonormal basis for the 
Hilbert space of square-integrable functions on 
the sphere, shown graphed along the radial 
direction 



71= — OO 

where 



a n = f f{0)t 

JO 



The example of adding up the first few terms in a Fourier series for a sawtooth function is shown in the figure. The 
basis functions are sine waves with wavelengths TJn (n=integer) shorter than the wavelength X of the sawtooth itself 
(except for n=l, the fundamental wave). All basis functions have nodes at the nodes of the sawtooth, but all but the 
fundamental have additional nodes. The oscillation of the summed terms about the sawtooth is called the Gibbs 
phenomenon. 

A significant problem in classical Fourier series asks in what sense the Fourier series converges, if at all, to the 

function/. Hilbert space methods provide one possible answer to this question. The functions e (9) = e mn form 

2 n 
an orthogonal basis of the Hilbert space L ([0,1]). Consequently, any square-integrable function can be expressed as 



a series 



f(0) = J2a n e n (9) : a n = (/, e n ) 



2 

and, moreover, this series converges in the Hilbert space sense (that is, in the L mean). 

The problem can also be studied from the abstract point of view: every Hilbert space has an orthonormal basis, and 
every element of the Hilbert space can be written in a unique way as a sum of multiples of these basis elements. The 
coefficients appearing on these basis elements are sometimes known abstractly as the Fourier coefficients of the 

element of the space P 4 ^ The abstraction is especially useful when it is more natural to use different basis functions 

2 

for a space such as L ([0,1]). In many circumstances, it is desirable not to decompose a function into trigonometric 

T351 

functions, but rather into orthogonal polynomials or wavelets for instance, and in higher dimensions into spherical 
harmonics P 6 ^ 

2 2 
For instance, if e are any orthonormal basis functions of L [0,1], then a given function in L [0,1] can be 

n T371 
approximated as a finite linear combination 

fix) n f n (x) = a^e^x) + a 2 e 2 (x) H h a n e n (x) 

2 

The coefficients {a.} are selected to make the magnitude of the difference \\f - f\\ as small as possible. 
Geometrically, the best approximation is the orthogonal projection of / onto the subspace consisting of all linear 

R81 

combinations of the {e.}, and can be calculated by 



dj = J ej(x)f(x)dx. 



2 

That this formula minimizes the difference Wf-f II is a consequence of Bessel's inequality and Parseval's formula. 
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In various applications to physical problems, a function can be decomposed into physically meaningful 
eigenfunctions of a differential operator (typically the Laplace operator): this forms the foundation for the spectral 
study of functions, in reference to the spectrum of the differential operator P 9 ^ A concrete physical application 
involves the problem of hearing the shape of a drum: given the fundamental modes of vibration that a drumhead is 
capable of producing, can one infer the shape of the drum itself? ^ The mathematical formulation of this question 
involves the Dirichlet eigenvalues of the Laplace equation in the plane, that represent the fundamental modes of 
vibration in direct analogy with the integers that represent the fundamental modes of vibration of the violin string. 

Spectral theory also underlies certain aspects of the Fourier transform of a function. Whereas Fourier analysis 
decomposes a function defined on a compact set into the discrete spectrum of the Laplacian (which corresponds to 
the vibrations of a violin string or drum), the Fourier transform of a function is the decomposition of a function 
defined on all of Euclidean space into its components in the continuous spectrum of the Laplacian. The Fourier 
transformation is also geometrical, in a sense made precise by the Plancherel theorem, that asserts that it is an 
isometry of one Hilbert space (the "time domain") with another (the "frequency domain"). This isometry property of 
the Fourier transformation is a recurring theme in abstract harmonic analysis, as evidenced for instance by the 
Plancherel theorem for spherical functions occurring in noncommutative harmonic analysis. 




Quantum mechanics 



In the mathematically rigorous formulation of quantum mechanics, 
developed by Paul Dirac^ and John von Neumann ^ , the possible 
states (more precisely, the pure states) of a quantum mechanical system 
are represented by unit vectors (called state vectors) residing in a 
complex separable Hilbert space, known as the state space, well 
defined up to a complex number of norm 1 (the phase factor). In other 
words, the possible states are points in the projectivization of a Hilbert 
space, usually called the complex projective space. The exact nature of 
this Hilbert space is dependent on the system; for example, the position 
and momentum states for a single non-relativistic spin zero particle is 
the space of all square-integrable functions, while the states for the 
spin of a single proton are unit elements of the two-dimensional 
complex Hilbert space of spinors. Each observable is represented by a 
self-adjoint linear operator acting on the state space. Each eigenstate of 

an observable corresponds to an eigenvector of the operator, and the associated eigenvalue corresponds to the value 
of the observable in that eigenstate. 



The orbitals of an electron in a hydrogen atom are 
eigenfunctions of the energy. 



The time evolution of a quantum state is described by the Schrodinger equation, in which the Hamiltonian, the 
operator corresponding to the total energy of the system, generates time evolution. 

The inner product between two state vectors is a complex number known as a probability amplitude. During an ideal 
measurement of a quantum mechanical system, the probability that a system collapses from a given initial state to a 
particular eigenstate is given by the square of the absolute value of the probability amplitudes between the initial and 
final states. The possible results of a measurement are the eigenvalues of the operator — which explains the choice of 
self-adjoint operators, for all the eigenvalues must be real. The probability distribution of an observable in a given 
state can be found by computing the spectral decomposition of the corresponding operator. 

For a general system, states are typically not pure, but instead are represented as statistical mixtures of pure states, or 
mixed states, given by density matrices: self-adjoint operators of trace one on a Hilbert space. Moreover, for general 
quantum mechanical systems, the effects of a single measurement can influence other parts of a system in a manner 
that is described instead by a positive operator valued measure. Thus the structure both of the states and observables 
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in the general theory is considerably more complicated than the idealization for pure states. 

Heisenberg's uncertainty principle is represented by the statement that the operators corresponding to certain 
observables do not commute, and gives a specific form that the commutator must have. 



Properties 
Pythagorean identity 

Two vectors u and v in a Hilbert space H are orthogonal when (u,v) =0. The notation for this is u J_ v. More 
generally, when S is a subset in H, the notation u ± S means that u is orthogonal to every element from S. 
When u and v are orthogonal, one has 

\\u + v\\ 2 = {u + u + v) = (u, u) + 2 Re(ii, v) + (v, v) = ||ii|| 2 + ||^|| 2 . 
By induction on n, this is extended to any family u ,...,u of n orthogonal vectors, 



ui H h ^|| 2 = ||tii|| 2 H h ||ie, 




Whereas the Pythagorean identity as stated is valid in any inner product space, completeness is required for the 
extension of the Pythagorean identity to series. A series 2 of orthogonal vectors converges in H if and only if the 
series of squares of norms converges, and 

oc oo 

IIX^II 2 = Eii^ii 2 - 

k=0 fc=0 

Furthermore, the sum of a series of orthogonal vectors is independent of the order in which it is taken. 



Parallelogram identity and polarization 



By definition, every Hilbert space is also a Banach space. Furthermore, 
in every Hilbert space the following parallelogram identity holds: 




Geometrically, the parallelogram identity asserts 



that AC 2 + BD 2 = 



2(AB 2 + AD 2 ). In words, the 



sum of the squares of the diagonals is twice the 
sum of the squares of any two adjacent sides. 



|| u + t ;|| 2 + || W -t,|| 2 = 2(H| 2 + ||i;|| 2 ). 
Conversely, every Banach space in which the parallelogram identity holds is a Hilbert space, and the inner product is 
uniquely determined by the norm by the polarization identity J 43 ^ For real Hilbert spaces, the polarization identity is 



{u, v) = - ^|| it + || 2 — \\u — i;|| 2 ^ 



4 

For complex Hilbert spaces, it is 

(Ujv) = -^\\u + v\\ 2 — \\u — v\\ 2 + i\\u + iv\\ 2 — i\\u — iv\\ 2 ^ . 
The parallelogram law implies that any Hilbert space is a uniformly convex Banach space J 44 ^ 
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Best approximation 

If C is a non-empty closed convex subset of a Hilbert space H and x a point in H, there exists a unique point y G C 
which minimizes the distance between x and points in c} 45 ^ 

y eC, \\x - y\\ = dist(x,C) = min{||x - z\\ : z G C}. 
This is equivalent to saying that there is a point with minimal norm in the translated convex set D = C - x. The proof 
consists in showing that every minimizing sequence (<i ) C D is Cauchy (using the parallelogram identity) hence 
converges (using completeness) to a point in D that has minimal norm. More generally, this holds in any uniformly 
convex Banach space J 46 ^ 

When this result is applied to a closed subspace F of H, it can be shown that the point y G F closest to x is 
characterized by'- 47 -' 

y£F, x-y±F. 

This point y is the orthogonal projection of x onto F, and the mapping P : x —> y is linear (see Orthogonal 

t 

complements and projections). This result is especially significant in applied mathematics, especially numerical 
analysis, where it forms the basis of least squares methods. 

In particular, when F is not equal to H, one can find a non-zero vector v orthogonal to F (select x not in F and v = x- 
y). A very useful criterion is obtained by applying this observation to the closed subspace F generated by a subset S 
ofH. 

A subset S of H spans a dense vector subspace if (and only if) the vector 0 is the sole vector v G H orthogonal 
to S. 

Duality 

The dual space H* is the space of all continuous linear functions from the space H into the base field. It carries a 
natural norm, defined by 

IMI = su p 

\\x\\=i,xeH 

This norm satisfies the parallelogram law, and so the dual space is also an inner product space. The dual space is also 
complete, and so it is a Hilbert space in its own right. 

The Riesz representation theorem affords a convenient description of the dual. To every element u of H, there is a 
unique element cp of H*, defined by 

tp u (x) = {x,u). 

The mapping U \— > ip u is an antilinear mapping from H to H* . The Riesz representation theorem states that this 

T481 * 

mapping is an antilinear isomorphism. Thus to every element cp of the dual H there exists one and only one in 
H such that 

{x,u v ) = tp{x) 
for all x G H. The inner product on the dual space H* satisfies 

The reversal of order on the right-hand side restores linearity in cp from the antilinearity of u . In the real case, the 
antilinear isomorphism from H to its dual is actually an isomorphism, and so real Hilbert spaces are naturally 
isomorphic to their own duals. 

The representing vector is obtained in the following way. When cp ± 0, the kernel F = ker cp is a closed vector 
subspace of H, not equal to H, hence there exists a non-zero vector v orthogonal to F. The vector u is a suitable 
scalar multiple Av of v. The requirement that ^(v) = (v, u) yields 



u = {v, v) 1 ip{v) v. 
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This correspondence cp <-» u is exploited by the bra-ket notation popular in physics. It is common in physics to 
assume that the inner product, denoted by (x\y), is linear on the right, 

(x\y) = (y,x). 

The result (x\y) can be seen as the action of the linear functional (x\ (the bra) on the vector \y) (the kef). 

The Riesz representation theorem relies fundamentally not just on the presence of an inner product, but also on the 
completeness of the space. In fact, the theorem implies that the topological dual of any inner product space can be 
identified with its completion. An immediate consequence of the Riesz representation theorem is also that a Hilbert 
space H is reflexive, meaning that the natural map from H into its double dual space is an isomorphism. 

Weakly convergent sequences 

In a Hilbert space H, a sequence {x } is weakly convergent to a vector xE// when 
lim(x n , v) = (x, v) 

71 

for every v £ H. 

For example, any orthonormal sequence {f^} converges weakly to 0, as a consequence of Bessel's inequality. Every 
weakly convergent sequence {x } is bounded, by the uniform boundedness principle. 

Conversely, every bounded sequence in a Hilbert space admits weakly convergent subsequences (Alaoglu's 
theorem)J 49] This fact may be used to prove minimization results for continuous convex functionals, in the same 
way that the Bolzano- Weierstrass theorem is used for continuous functions on R^. Among several variants, one 
simple statement is as follows: t50] 

Iff: H — » R is a convex continuous function such that/(x) tends to +°o when llxll tends to ©o, then /admits a 
minimum at some point x Q G H. 

This fact (and its various generalizations) are fundamental for direct methods in the calculus of variations. 
Minimization results for convex functionals are also a direct consequence of the slightly more abstract fact that 
closed bounded convex subsets in a Hilbert space H are weakly compact, since H is reflexive. The existence of 
weakly convergent subsequences is a special case of the Eberlein-Smulian theorem. 

Banach space properties 

Any general property of Banach spaces continues to hold for Hilbert spaces. The open mapping theorem states that a 
continuous surjective linear transformation from one Banach space to another is an open mapping meaning that it 
sends open sets to open sets. A corollary is the bounded inverse theorem, that a continuous and bijective linear 
function from one Banach space to another is an isomorphism (that is, a continuous linear map whose inverse is also 
continuous). This theorem is considerably simpler to prove in the case of Hilbert spaces than in general Banach 
spaces.^ ^ The open mapping theorem is equivalent to the closed graph theorem, which asserts that a function from 
one Banach space to another is continuous if and only if its graph is a closed set. In the case of Hilbert spaces, this 
is basic in the study of unbounded operators (see closed operator). 

The (geometrical) Hahn-Banach theorem asserts that a closed convex set can be separated from any point outside it 
by means of a hyperplane of the Hilbert space. This is an immediate consequence of the best approximation 
property: if y is the element of a closed convex set F closest to x, then the separating hyperplane is the plane 
perpendicular to the segment xy passing through its midpoint. 

The distortion problem on Hilbert space asks whether or not every real valued Lipschitz function / defined on the 
sphere of a separable and infinite dimensional Hilbert space JJ stabilizes on the sphere of an infinite dimensional 
subspace, i.e. whether there is a real number a G R so that for every 6 > 0 there is an infinite dimensional subspace 
Y°f H > so mat ' a_ /(y)'< f° r al l y ^ Y, with llyll=l. This problem was solved negatively by E. Odell and 
Th.Schlumprecht (1994). 
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Operators on Hilbert spaces 
Bounded operators 

The continuous linear operators A : —> from a Hilbert space to a second Hilbert space are bounded in 
the sense that they map bounded sets to bounded sets. Conversely, if an operator is bounded, then it is continuous. 
The space of such bounded linear operators has a norm, the operator norm given by 

\\A\\ = sup{ \\Ax\\ : < 1}. 

The sum and the composite of two bounded linear operators is again bounded and linear. For y in H^, the map that 
sends x G H to <Ax, y> is linear and continuous, and according to the Riesz representation theorem can therefore be 
represented in the form 

(x,A*y) = (Ax,y) 

for some vector A* y in This defines another bounded linear operator A* : H^—> H^ 9 the adjoint of A. One can see 
that A** = A. 

The set B(H) of all bounded linear operators on H, together with the addition and composition operations, the norm 
and the adjoint operation, is a C -algebra, which is a type of operator algebra. 

An element A of B(H) is called self -adjoint or Hermitian if A* = A. If A is Hermitian and (Ax, x) > 0 for every x, 
then A is called non-negative, written A > 0; if equality holds only when x = 0, then A is called positive. The set of 
self adjoint operators admits a partial order, in which A > B if A - B > 0. If A has the form B*B for some B, then A is 
non-negative; if B is invertible, then A is positive. A converse is also true in the sense that, for a non-negative 
operator A, there exists a unique non-negative square root B such that 

A = B 2 = B*B. 

In a sense made precise by the spectral theorem, self-adjoint operators can usefully be thought of as operators that 
are "real". An element A of B(H) is called normal if A*A = A A*. Normal operators decompose into the sum of a 
self-adjoint operators and an imaginary multiple of a self adjoint operator 

A + A* .{A- A*) 
A ~^^ + l 2i 

that commute with each other. Normal operators can also usefully be thought of in terms of their real and imaginary 
parts. 

An element U of B(H) is called unitary if U is invertible and its inverse is given by U*. This can also be expressed by 
requiring that U be onto and ( Ux, Uy) = (x, y) for all x and y in H. The unitary operators form a group under 
composition, which is the isometry group of H. 

An element of B(H) is compact if it sends bounded sets to relatively compact sets. Equivalently, a bounded operator 
Tis compact if, for any bounded sequence {jc }, the sequence {Tx } has a convergent subsequence. Many integral 
operators are compact, and in fact define a special class of operators known as Hilbert-Schmidt operators that are 
especially important in the study of integral equations. Fredholm operators are those which differ from a compact 
operator by a multiple of the identity, and are equivalently characterized as operators with a finite dimensional kernel 
and cokernel. The index of a Fredholm operator Tis defined by 

index T = dim ker T — dim coker T. 

The index is homotopy invariant, and plays a deep role in differential geometry via the Atiyah-Singer index 
theorem. 
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Unbounded operators 

Unbounded operators are also tractable in Hilbert spaces, and have important applications to quantum mechanics P 4 ^ 
An unbounded operator T on a Hilbert space H is defined to be a linear operator whose domain D(T) is a linear 
subspace of H. Often the domain D(T) is a dense subspace of H, in which case T is known as a densely-defined 
operator. 

The adjoint of a densely defined unbounded operator is defined in essentially the same manner as for bounded 
operators. Self-adjoint unbounded operators play the role of the observable s in the mathematical formulation of 
quantum mechanics. Examples of self-adjoint unbounded operators on the Hilbert space L 2 (R) are:'" 55 -' 

• A suitable extension of the differential operator 

(Af)(x)=i±f(x), 

where / is the imaginary unit and /is a differentiable function of compact support. 

• The multiplication-by-x operator: 

(Bf)(x) = xf(x). 

These correspond to the momentum and position observables, respectively. Note that neither A nor B is defined on 

all of H, since in the case of A the derivative need not exist, and in the case of B the product function need not be 

2 

square integrable. In both cases, the set of possible arguments form dense subspaces of L (R). 



Constructions 
Direct sums 

Two Hilbert spaces and H 2 can be combined into another Hilbert space, called the (orthogonal) direct sum, t56] 
and denoted 

h x e h 2 , 

consisting of the set of all ordered pairs (x^, x^) where x. G H., i = 1,2, and inner product defined by 
More generally, if H. is a family of Hilbert spaces indexed by / G /, then the direct sum of the H., denoted 
iei 

consists of the set of all indexed families 

x=(x i £H i \i£l)£l[H i 

iei 

in the Cartesian product of the H such that 

^ \\xi\\ 2 < OC. 
iei 

The inner product is defined by 
iei 

Each of the H. is included as a closed subspace in the direct sum of all of the H.. Moreover, the H. are pairwise 
orthogonal. Conversely, if there is a system of closed subspaces V , i G /, in a Hilbert space H which are pairwise 
orthogonal and whose union is dense in H, then H is canonically isomorphic to the direct sum of V.. In this case, H is 
called the internal direct sum of the V. A direct sum (internal or external) is also equipped with a family of 
orthogonal projections E. onto the ith direct summand These projections are bounded, self-adjoint, idempotent 
operators which satisfy the orthogonality condition 
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EkE 3 = 0, i± j. 

The spectral theorem for compact self-adjoint operators on a Hilbert space H states that H splits into an orthogonal 
direct sum of the eigenspaces of an operator, and also gives an explicit decomposition of the operator as a sum of 
projections onto the eigenspaces. The direct sum of Hilbert spaces also appears in quantum mechanics as the Fock 
space of a system containing a variable number of particles, where each Hilbert space in the direct sum corresponds 
to an additional degree of freedom for the quantum mechanical system. In representation theory, the Peter- Weyl 
theorem guarantees that any unitary representation of a compact group on a Hilbert space splits as the direct sum of 
finite-dimensional representations . 

Tensor products 

If if 1 and H^, then one defines an inner product on the (ordinary) tensor product as follows. On simple tensors, let 

This formula then extends by sesquilinearity to an inner product on Hi ® fl*2- The Hilbertian tensor product of 
and if , sometimes denoted by H\®H2> * s me Hilbert space obtained by completing H\ ® i?2f° r me metric 
associated to this inner product. 

2 2 
An example is provided by the Hilbert space L ([0, 1]). The Hilbertian tensor product of two copies of L ([0, 1]) is 

2 2 2 

isometrically and linearly isomorphic to the space L ([0, 1] ) of square-integrable functions on the square [0, 1] . 

This isomorphism sends a simple tensor fi ® /j^ 0 me function 

(s,t) » Ms) f 2 (t) 

on the square. 

T581 

This example is typical in the following sense. Associated to every simple tensor product X\ ® rz^is the rank 
one operator 

x* € H* — > x*{xi) X2 

from the (continuous) dual if * to if . This mapping defined on simple tensors extends to a linear identification 
between Hi ® i?2 an d me space of finite rank operators from H* to H^. This extends to a linear isometry of the 
Hilbertian tensor product i^gj/i^with me Hilbert space HS{H*, of Hilbert-Schmidt operators from if * to 

Orthonormal bases 

The notion of an orthonormal basis from linear algebra generalizes over to the case of Hilbert spaces. In a Hilbert 
space if, an orthonormal basis is a family {e k ) k e B of elements of H satisfying the conditions: 

1 . Orthogonality: Every two different elements of B are orthogonal: (e , e .)= 0 for all k, j in B with k * j. 

2. Normalization: Every element of the family has norm 1:11^11 = 1 for all k in B. 

3. Completeness: The linear span of the family e , k G B, is dense in H. 

A system of vectors satisfying the first two conditions basis is called an orthonormal system or an orthonormal set 
(or an orthonormal sequence if B is countable). Such a system is always linearly independent. Completeness of an 
orthonormal system of vectors of a Hilbert space can be equivalently restated as: 

if (v, e^j = 0 for all k € B and some v G if then v = 0. 

This is related to the fact that the only vector orthogonal to a dense linear subspace is the zero vector, for if S is any 
orthonormal set and v is orthogonal to S, then v is orthogonal to the closure of the linear span of S, which is the 
whole space. 

Examples of orthonormal bases include: 

• the set {(1,0,0), (0,1,0), (0,0,1)} forms an orthonormal basis of R with the dot product; 
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2 

• the sequence [f^ : n G Z} with/^(x) = exp(2jtmx) forms an orthonormal basis of the complex space L ([0,1]); 

In the infinite-dimensional case, an orthonormal basis will not be a basis in the sense of linear algebra; to distinguish 
the two, the latter basis is also called a Hamel basis. That the span of the basis vectors is dense implies that every 
vector in the space can be written as the sum of an infinite series, and the orthogonality implies that this 
decomposition is unique. 

Sequence spaces 

2 

The space D of square- summable sequences of complex numbers has an orthonormal basis 
ei = (l,0,0,...) 
e 2 = (0,l,0,...) 

More generally, if B is any set, then one can form a Hilbert space of sequences with index set B, defined by 

£ 2 (B) = {x : B^C | \*( b )\ 2 < °°}- 

beB 

The summation over B is here defined by 

Y,\*(b)\ 2 =su P f:\x(b n )\i 

beB n=i 

the supremum being taken over all finite subsets of B. It follows that, in order for this sum to be finite, every element 

2 

of D (B) has only countably many nonzero terms. This space becomes a Hilbert space with the inner product 



v) = S x ( b )y( b ) 



beB 

2 

for all x and y in D (B). Here the sum also has only countably many nonzero terms, and is unconditionally convergent 

by the Cauchy-Schwarz inequality. 

2 

An orthonormal basis of D (B) is indexed by the set B, given by 



e b (b f ) = 



1 if b = b' 
0 otherwise. 



Bessel's inequality and Parseval's formula 

Let/ 1? . . .,f n be a finite orthonormal system in H. For an arbitrary vector x in H, let 

71 

i=i 

Then {x,f^ = f° r every k = 1, . . ., n. It follows that x - y is orthogonal to each/^, hence x - y is orthogonal to y. 
Using the Pythagorean identity twice, it follows that 

||x|| 2 = ||^ - + II2/H 2 > ||y|| 2 = f: |{^ /: ,->| 2 . 

Let {f. },/€/, be an arbitrary orthonormal system in H. Applying the preceding inequality to every finite subset / of 
/ gives the Bessel inequality^ 

Y,\(x,m 2 <\\x\\\ xGH 
iei 

(according to the definition of the sum of an arbitrary family of non-negative real numbers). 

Geometrically, Bessel's inequality implies that the orthogonal projection of x onto the linear subspace spanned by the 
f. has norm that does not exceed that of x. In two dimensions, this is the assertion that the length of the leg of a right 
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triangle may not exceed the length of the hypotenuse. 

Bessel's inequality is a stepping stone to the more powerful Parseval identity which governs the case when Bessel's 
inequality is actually an equality. If {e^ k £ ^ is an orthonormal basis of H, then every element x of H may be written 

as 

keB 

Even if B is uncountable, Bessel's inequality guarantees that the expression is well-defined and consists only of 
countably many nonzero terms. This sum is called the Fourier expansion of x, and the individual coefficients (x,e^ 
are the Fourier coefficients of x. Parseval's formula is then 

Hx|| 2 = eimi 2 . 

Conversely, if {e^\ is an orthonormal set such that Parseval's identity holds for every x, then {e^\ is an orthonormal 
basis. 



Hilbert dimension 

As a consequence of Zorn's lemma, every Hilbert space admits an orthonormal basis; furthermore, any two 
orthonormal bases of the same space have the same cardinality, called the Hilbert dimension of the space/ 61] For 

2 

instance, since D (B) has an orthonormal basis indexed by B, its Hilbert dimension is the cardinality of B (which may 
be a finite integer, or a countable or uncountable cardinal number). 

2 

As a consequence of Parseval's identity, if {e^ k g B is an orthonormal basis of H, then the map O : H — » £ (B) 
defined by = (( x > e ])) keB * s an isometric isomorphism of Hilbert spaces: it is a bijective linear mapping such that 

(^(x)^(y)) £ 2 {B) = {x,y) H 
for all x and y in H. The cardinal number of B is the Hilbert dimension of H. Thus every Hilbert space is 
isometrically isomorphic to a sequence space £ 2 {B) for some set B. 

Separable spaces 

A Hilbert space is separable if and only if it admits a countable orthonormal basis. All infinite-dimensional separable 
Hilbert spaces are therefore isometrically isomorphic to . 

In the past, Hilbert spaces were often required to be separable as part of the definition J 62 ^ Most spaces used in 
physics are separable, and since these are all isomorphic to each other, one often refers to any infinite-dimensional 
separable Hilbert space as "the Hilbert space" or just "Hilbert space" J 63 ^ Even in quantum field theory, most of the 
Hilbert spaces are in fact separable, as stipulated by the Wightman axioms. However, it is sometimes argued that 
non-separable Hilbert spaces are also important in quantum field theory, roughly because the systems in the theory 
possess an infinite number of degrees of freedom and any infinite Hilbert tensor product (of spaces of dimension 
greater than one) is non- separable J 64 ^ For instance, a bosonic field can be naturally thought of as an element of a 
tensor product whose factors represent harmonic oscillators at each point of space. From this perspective, the natural 
state space of a boson might seem to be a non-separable space J 64 ^ However, it is only a small separable subspace of 
the full tensor product that can contain physically meaningful fields (on which the observables can be defined). 
Another non- separable Hilbert space models the state of an infinite collection of particles in an unbounded region of 
space. An orthonormal basis of the space is indexed by the density of the particles, a continuous parameter, and since 
the set of possible densities is uncountable, the basis is not countable J 64 ^ 
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Orthogonal complements and projections 

If S is a subset of a Hilbert space H, the set of vectors orthogonal to S is defined by 

S 1 - = {x G H : (x, 5) = 0 Vs G 5} . 

is a closed subspace of H and so forms itself a Hilbert space. If V is a closed subspace of H, then V 1 is called the 
orthogonal complement of V. In fact, every xinH can then be written uniquely as x = v + w, with v in V and w in 
Therefore, 7f is the internal Hilbert direct sum of V and V 1 . 

The linear operator : H — » // which maps x to v is called the orthogonal projection onto V. There is a natural 
one-to-one correspondence between the set of all closed subspaces of H and the set of all bounded self-adjoint 

2 

operators P such that P =P. Specifically, 

Theorem. The orthogonal projection P y is a self-adjoint linear operator on H of norm < 1 with the property 

2 2 
P = P^. Moreover, any self-adjoint linear operator E such that E = E is of the form P^, where V is the range 

-vll. 

[65] 



of E. For every x in H, P is the unique element v of V which minimizes the distance \\x - vll. 



This provides the geometrical interpretation of P^(x): it is the best approximation to x by elements of V. 

An operator P such that P = P 2 = P* is called an orthogonal projection. The orthogonal projection P y onto a closed 
subspace V of H is the adjoint of the inclusion mapping 

i v : V -> ff, 
meaning that 

<i v x, 2/) = (x, iVy) 

for all x G // and y G V. Projections and P y are called mutually orthogonal if PjjPy = 0- This is equivalent to U 
and V being orthogonal as subspaces of H. As a result, the sum of the two projections P^and P^is only a projection 
if U and V are orthogonal to each other, and in that case P u + P y = P u+y - The composite P^Py is generally not a 
projection; in fact, the composite is a projection if and only if the two projections commute, and in that case 

P U P V = P UnV 

The operator norm of a projection P onto a non-zero closed subspace is equal to one: 
llPxIl 

||jP|| = sup = 1. 

xEH,x^0 \\x\\ 

2 

Every closed subspace V of a Hilbert space is therefore the image of an operator P of norm one such that P = P. In 
fact this property characterizes Hilbert spaces 

• A Banach space of dimension higher than 2 is (isometrically) a Hilbert space if and only if, to every closed 
subspace V, there is an operator P y of norm one whose image is V such that Py = P v . 

While this result characterizes the metric structure of a Hilbert space, the structure of a Hilbert space as a topological 
vector space can itself be characterized in terms of the presence of complementary subspaces: ^ 

• A Banach space X is topologically and linearly isomorphic to a Hilbert space if and only if, to every closed 
subspace V, there is a closed subspace W such that X is equal to the internal direct sum V © W • 

The orthogonal complement satisfies some more elementary results. It is a monotone function in the sense that if 
U C V , then V 1 " C U 1 " with equality holding if and only if V is contained in the closure of U. This result is a 
special case of the Hahn-Banach theorem. The closure of a subspace can be completely characterized in terms of the 
orthogonal complement: If V is a subspace of H, then the closure of V is equal to y^ 1 - • The orthogonal 
complement is thus a Galois connection on the partial order of subspaces of a Hilbert space. In general, the 
orthogonal complement of a sum of subspaces is the intersection of the orthogonal complements:^ 68 ^ 
(Ei Vi) 1 ' = Hi Vi~ • If the V i are in addition closed, then = (Hi ■ 
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Spectral theory 

There is a well-developed spectral theory for self-adjoint operators in a Hilbert space, that is roughly analogous to 
the study of symmetric matrices over the reals or self-adjoint matrices over the complex numbers J 69 ^ In the same 
sense, one can obtain a "diagonalization" of a self-adjoint operator as a suitable sum (actually an integral) of 
orthogonal projection operators. 

The spectrum of an operator T, denoted o(T) is the set of complex numbers X such that T-X lacks a continuous 
inverse. If T is bounded, then the spectrum is always a compact set in the complex plane, and lies inside the disc 
H<||T||. If Tis self-adjoint, then the spectrum is real. In fact, it is contained in the interval [m,M] where 

m — inf (Tx, x), M— sup (Tx,x). 
11*11=1 H|=i 

Moreover, m and M are both actually contained within the spectrum. 
The eigenspaces of an operator T are given by 
H\ = ker(T — A). 

Unlike with finite matrices, not every element of the spectrum of T must be an eigenvalue: the linear operator T - X 
may only lack an inverse because it is not surjective. Elements of the spectrum of an operator in the general sense are 
known as spectral values. Since spectral values need not be eigenvalues, the spectral decomposition is often more 
subtle than in finite dimensions. 

However, the spectral theorem of a self-adjoint operator T takes a particularly simple form if, in addition, T is 
assumed to be a compact operator. The spectral theorem for compact self-adjoint operators states: ^ 

• A compact self-adjoint operator Thas only countably (or finitely) many spectral values. The spectrum of T has no 
limit point in the complex plane except possibly zero. The eigenspaces of T decompose H into an orthogonal 
direct sum: 

\E<r(T) 

Moreover, if E denotes the orthogonal projection onto the eigenspace H , then 

A, A 

T= £ XE X 

where the sum converges with respect to the norm on B(H). 

This theorem plays a fundamental role in the theory of integral equations, as many integral operators are compact, in 
particular those that arise from Hilbert- Schmidt operators. 

The general spectral theorem for self-adjoint operators involves a kind of operator- valued Riemann-Stieltjes 
integral, rather than an infinite summation. 1 The spectral family associated to T associates to each real number X an 
operator E^, which is the projection onto the nullspace of the operator (T — A) + , where the positive part of a 
self-adjoint operator is defined by 

The operators E are monotone increasing relative to the partial order defined on self-adjoint operators; the 

A 

eigenvalues correspond precisely to the jump discontinuities. One has the spectral theorem, which asserts 

T= I XdE x . 

Jr 

The integral is understood as a Riemann-Stieltjes integral, convergent with respect to the norm on B(//). In 
particular, one has the ordinary scalar- valued integral representation 

(Tx : y) = / Xd{E x x,y). 
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A somewhat similar spectral decomposition holds for normal operators, although because the spectrum may now 
contain non-real complex numbers, the operator-valued Stieltjes measure dE. must instead be replaced by a 

A 

resolution of the identity. 

A major application of spectral methods is the spectral mapping theorem, which allows one to apply to a self-adjoint 
operator T any continuous complex function /defined on the spectrum of T by forming the integral 

f(T)= f f(X)dE x . 

Ja(T) 

T721 

The resulting continuous functional calculus has applications in particular to pseudodifferential operators. 

The spectral theory of unbounded self-adjoint operators is only marginally more difficult than for bounded operators. 
The spectrum of an unbounded operator is defined in precisely the same way as for bounded operators: X is a spectral 
value if the resolvent operator 

R\ = (T - A) -1 

fails to be a well-defined continuous operator. The self-adjointness of T still guarantees that the spectrum is real. 
Thus the essential idea of working with unbounded operators is to look instead at the resolvent R where X is 
non-real. This is a bounded normal operator, which admits a spectral representation that can then be transferred to a 
spectral representation of T itself. A similar strategy is used, for instance, to study the spectrum of the Laplace 
operator: rather than address the operator directly, one instead looks as an associated resolvent such as a Riesz 
potential or Bessel potential. 

T731 

A precise version of the spectral theorem which holds in this case is: 

Given a densely-defined self-adjoint operator T on a Hilbert space H, there corresponds a unique resolution of 
the identity E on the Borel sets of R, such that 

(Tx,y)= [ XdE x , y (X) 
Jr 

for all xG D(T) and y €H. The spectral measure E is concentrated on the spectrum of T. 
There is also a version of the spectral theorem that applies to unbounded normal operators. 
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Von Neumann algebra 

In mathematics, a von Neumann algebra or W*-algebra is a *-algebra of bounded operators on a Hilbert space that 
is closed in the weak operator topology and contains the identity operator. They were originally introduced by John 
von Neumann, motivated by the study of single operators, group representations, ergodic theory and quantum 
mechanics. His double commutant theorem shows that the analytic definition is equivalent to a purely algebraic 
definition as an algebra of symmetries. 

Two basic examples of von Neumann algebras are as follows. The ring L°°(R) of essentially bounded measurable 

functions on the real line is a commutative von Neumann algebra, which acts by pointwise multiplication on the 

2 

Hilbert space L (R) of square integrable functions. The algebra B{H) of all bounded operators on a Hilbert space H is 
a von Neumann algebra, non-commutative if the Hilbert space has dimension at least 2. 

Von Neumann algebras were first studied by von Neumann (1929); he and Francis Murray developed the basic 
theory, under the original name of rings of operators, in a series of papers written in the 1930s and 1940s (F.J. 
Murray & J. von Neumann 1936, 1937, 1943; J. von Neumann 1938, 1940, 1943, 1949), reprinted in the collected 
works of von Neumann (1961). 

Introductory accounts of von Neumann algebras are given in the online notes of Jones (2003) and Wassermann 
(1991) and the books by Dixmier (1981), Schwartz (1967), Blackadar (2005) and Sakai (1971). The three volume 
work by Takesaki (1979) gives an encyclopedic account of the theory. The book by Connes (1994) discusses more 
advanced topics. 

Definitions 

There are three common ways to define von Neumann algebras. 

The first and most common way is to define them as weakly closed * algebras of bounded operators (on a Hilbert 
space) containing the identity. In this definition the weak (operator) topology can be replaced by many other 
common topologies including the strong, ultrastrong or ultraweak operator topologies. The *-algebras of bounded 
operators that are closed in the norm topology are C*-algebras, so in particular any von Neumann algebra is a 
C*-algebra. 

The second definition is that a von Neumann algebra is a subset of the bounded operators closed under * and equal to 
its double commutant, or equivalently the commutant of some subset closed under *. The von Neumann double 
commutant theorem (von Neumann 1929) says that the first two definitions are equivalent. 

The first two definitions describe a von Neumann algebras concretely as a set of operators acting on some given 
Hilbert space. Sakai (1971) showed that von Neumann algebras can also be defined abstractly as C*-algebras that 
have a predual; in other words the von Neumann algebra, considered as a Banach space, is the dual of some other 
Banach space called the predual. The predual of a von Neumann algebra is in fact unique up to isomorphism. Some 
authors use "von Neumann algebra" for the algebras together with a Hilbert space action, and "W*-algebra" for the 
abstract concept, so a von Neumann algebra is a W*-algebra together with a Hilbert space and a suitable faithful 
unital action on the Hilbert space. The concrete and abstract definitions of a von Neumann algebra are similar to the 
concrete and abstract definitions of a C*-algebra, which can be defined either as norm-closed * algebras of operators 
on a Hilbert space, or as Banach *-algebras such that \\a a*\\=\\a\\ 
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Terminology 

Some of the terminology in von Neumann algebra theory can be confusing, and the terms often have different 
meanings outside the subject. 

• A factor is a von Neumann algebra with trivial center, i.e. a center consisting only of scalar operators. 

• A finite von Neumann algebra is one which is the direct integral of finite factors. Similarly, properly infinite von 
Neumann algebras are the direct integral of properly infinite factors. 

• A von Neumann algebra that acts on a separable Hilbert space is called separable. Note that such algebras are 
rarely separable in the norm topology. 

• The von Neumann algebra generated by a set of bounded operators on a Hilbert space is the smallest von 
Neumann algebra containing all those operators. 

• The tensor product of two von Neumann algebras acting on two Hilbert spaces is defined to be the von 
Neumann algebra generated by their algebraic tensor product, considered as operators on the Hilbert space tensor 
product of the Hilbert spaces. 

By forgetting about the topology on a von Neumann algebra, we can consider it a (unital) *-algebra, or just a ring. 
Von Neumann algebras are semihereditary: every finitely generated submodule of a projective module is itself 
projective. There have been several attempts to axiomatize the underlying rings of von Neumann algebras, including 
Baer *-rings and AW* algebras. The *-algebra of affiliated operators of a finite von Neumann algebra is a von 
Neumann regular ring. (The von Neumann algebra itself is in general not von Neumann regular.) 

Commutative von Neumann algebras 

Main article: Abelian von Neumann algebra 

The relationship between commutative von Neumann algebras and measure spaces is analogous to that between 
commutative C* -algebras and locally compact Hausdorff spaces. Every commutative von Neumann algebra is 
isomorphic to L°°(X) for some measure space (X, \i) and conversely, for every a-finite measure space X, the * algebra 
L°°(X) is a von Neumann algebra. 

Due to this analogy, the theory of von Neumann algebras has been called noncommutative measure theory, while the 
theory of C*-algebras is sometimes called noncommutative topology (Connes 1994). 

Projections 

Operators E in a von Neumann algebra for which E = EE = E* are called projections; they are exactly the operators 
which give an orthogonal projection of H onto some closed subspace. A subspace of the Hilbert space H is said to 
belong to the von Neumann algebra M if it is the image of some projection in M. Informally these are the closed 
subspaces that can be described using elements of M, or that M "knows" about. The closure of the image of any 
operator in M, or the kernel of any operator in M belong to M, and the closure of the image of any subspace 
belonging to M under an operator of M also belongs to M. There is a 1:1 correspondence between projections of M 
and subspaces that belong to it. 

The basic theory of projections was worked out by Murray & von Neumann (1936). Two subspaces belonging to M 
are called (Murray-von Neumann) equivalent if there is a partial isometry mapping the first isomorphically onto 
the other that is an element of the von Neumann algebra (informally, if M "knows" that the subspaces are 
isomorphic). This induces a natural equivalence relation on projections by defining E to be equivalent to F if the 
corresponding subspaces are equivalent, or in other words if there is a partial isometry of H that maps the image of E 
isometrically to the image of F and is an element of the von Neumann algebra. Another way of stating this is that E 
is equivalent to F if E=uu* and F-uu for some partial isometry u in M. 

The equivalence relation ~ thus defined is additive in the following sense: Suppose E^ ~ F^ and E^~ F^. If E^ J_ 
and F J_ F then E + E ~ F + F . This is not true in general if one requires unitary equivalence in the definition of 
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~, i.e. if we say E is equivalent to F if u*Eu = F for some unitary u. . 

The subspaces belonging to M are partially ordered by inclusion, and this induces a partial order < of projections. 
There is also a natural partial order on the set of equivalence classes of projections, induced by the partial order < of 
projections. If M is a factor, < is a total order on equivalence classes of projections, described in the section on traces 
below. 

A projection (or subspace belonging to M) E is said to he finite if there is no projection F < E that is equivalent to E. 
For example, all finite-dimensional projections (or subspaces) are finite (since isometries between Hilbert spaces 
leave the dimension fixed), but the identity operator on an infinite-dimensional Hilbert space is not finite in the von 
Neumann algebra of all bounded operators on it, since it is isometrically isomorphic to a proper subset of itself. 
However it is possible for infinite dimensional subspaces to be finite. 

Orthogonal projections are noncommutative analogues of indicator functions in L°°(R). L°°(R) is the ll ll^-closure of 
the subspace generated by the indicator functions. Similarly, a von Neumann algebra is generated by its projections; 
this is a consequence of the spectral theorem for self-adjoint operators. 

Factors 

A von Neumann algebra N whose center consists only of multiples of the identity operator is called a factor, von 
Neumann (1949) showed that every von Neumann algebra on a separable Hilbert space is isomorphic to a direct 
integral of factors. This decomposition is essentially unique. Thus, the problem of classifying isomorphism classes of 
von Neumann algebras on separable Hilbert spaces can be reduced to that of classifying isomorphism classes of 
factors. 

Murray & von Neumann (1936) showed that every factor has one of 3 types as described below. The type 
classification can be extended to von Neumann algebras that are not factors, and a von Neumann algebra is of type X 
if it can be decomposed as a direct integral of type X factors; for example, every commutative von Neumann algebra 
has type I . Every von Neumann algebra can be written uniquely as a sum of von Neumann algebras of types I, II, 
and III. 

There are several other ways to divide factors into classes that are sometimes used: 

• A factor is called discrete (or occasionally tame) if it has type I, and continuous (or occasionally wild) if it has 
type II or III. 

• A factor is called semifinite if it has type I or II, and purely infinite if it has type III. 

• A factor is called finite if the projection 1 is finite and properly infinite otherwise. Factors of types I and II may 
be either finite or properly infinite, but factors of type III are always properly infinite. 

Type I factors 

A factor is said to be of type I if there is a minimal projection E # 0, i.e. a projection E such that there is no other 
projection F with 0 < F < E. Any factor of type I is isomorphic to the von Neumann algebra of all bounded operators 
on some Hilbert space; since there is one Hilbert space for every cardinal number, isomorphism classes of factors of 
type I correspond exactly to the cardinal numbers. Since many authors consider von Neumann algebras only on 
separable Hilbert spaces, it is customary to call the bounded operators on a Hilbert space of finite dimension n a 
factor of type I , and the bounded operators on a separable infinite-dimensional Hilbert space, a factor of type I . 
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Type II factors 

A factor is said to be of type II if there are no minimal projections but there are non-zero finite projections. This 
implies that every projection E can be halved in the sense that there are equivalent projections F and G such that E = 
F + G. If the identity operator in a type II factor is finite, the factor is said to be of type 11^ otherwise, it is said to be 
of type 11^. The best understood factors of type II are the hyperfinite type II factor and the hyperfinite type 11^ 
factor, found by Murray & von Neumann (1936). These are the unique hyperfinite factors of types II 1 and IIj there 
are an uncountable number of other factors of these types that are the subject of intensive study. Murray & von 
Neumann (1937) proved the fundamental result that a factor of type II 1 has a unique finite tracial state, and the set of 
traces of projections is [0,1]. 

A factor of type II has a semifinite trace, unique up to rescaling, and the set of traces of projections is [0,°o]. The set 
of real numbers X such that there is an automorphism rescaling the trace by a factor of X is called the fundamental 
group of the type II factor. 

The tensor product of a factor of type II and an infinite type I factor has type 11^, and conversely any factor of type 
11^ can be constructed like this. The fundamental group of a type II 1 factor is defined to be the fundamental group 
of its tensor product with the infinite (separable) factor of type I. For many years it was an open problem to find a 
type II factor whose fundamental group was not the group of all positive reals, but Connes then showed that the von 
Neumann group algebra of a countable discrete group with Kazhdan's property T (the trivial representation is 
isolated in the dual space), such as SL (Z), has a countable fundamental group. Subsequently Sorin Popa showed 
that the fundamental group can be trivial for certain groups, including the semidirect product of Z by SL 2 (Z). 

An example of a type II factor is the von Neumann group algebra of a countable infinite discrete group such that 
every non-trivial conjugacy class is infinite. McDuff (1969) found an uncountable family of such groups with 
non-isomorphic von Neumann group algebras, thus showing the existence of uncountably many different separable 
type II 1 factors. 

Type III factors 

Lastly, type III factors are factors that do not contain any nonzero finite projections at all. In their first paper Murray 
& von Neumann (1936) were unable decide whether or not they existed; the first examples were later found by von 
Neumann (1940). Since the identity operator is always infinite in those factors, they were sometimes called type III^ 
in the past, but recently that notation has been superseded by the notation III., where X is a real number in the 
interval [0,1]. More precisely, if the Connes spectrum (of its modular group) is 1 then the factor is of type III 0 , if the 
Connes spectrum is all integral powers of X for 0 < X < 1 , then the type is III , and if the Connes spectrum is all 
positive reals then the type is III^ (The Connes spectrum is a closed subgroup of the positive reals, so these are the 
only possibilities.) The only trace on type III factors takes value °o on all non-zero positive elements, and any two 
non-zero projections are equivalent. At one time type III factors were considered to be intractable objects, but 
Tomita-Takesaki theory has led to a good structure theory. In particular, any type III factor can be written in a 
canonical way as the crossed product of a type II factor and the real numbers. 
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The predual 

Any von Neumann algebra M has a predual M^, which is the Banach space of all ultraweakly continuous linear 
functionals on M. As the name suggests, Mis (as a Banach space) the dual of its predual. The predual is unique in the 
sense that any other Banach space whose dual is M is canonically isomorphic to M^. Sakai (1971) showed that the 
existence of a predual characterizes von Neumann algebras among C* algebras. 

The definition of the predual given above seems to depend on the choice of Hilbert space that M acts on, as this 
determines the ultraweak topology. However the predual can also be defined without using the Hilbert space that M 
acts on, by defining it to be the space generated by all positive normal linear functionals on M. (Here "normal" 
means that it preserves suprema when applied to increasing nets of self adjoint operators; or equivalently to 
increasing sequences of projections.) 

The predual is a closed subspace of the dual M (which consists of all norm-continuous linear functionals on M) 
but is generally smaller. The proof that is (usually) not the same as M is nonconstructive and uses the axiom of 
choice in an essential way; it is very hard to exhibit explicit elements of M that are not in M^. For example, exotic 
positive linear forms on the von Neumann algebra f° (Z) are given by free ultrafilters; they correspond to exotic 
*-homomorphisms into C and describe the Stone-Cech compactification of Z. 

Examples: 

1. The predual of the von Neumann algebra L°°(R) of essentially bounded functions on R is the Banach space L l (R) 
of integrable functions. The dual of L°°(R) is strictly larger than L*(R) For example, a functional on L°°(R) that 
extends the Dirac measure 6 Q on the closed subspace of bounded continuous functions C° b (R) cannot be 
represented as a function in L l (R). 

2. The predual of the von Neumann algebra B(H) of bounded operators on a Hilbert space H is the Banach space of 
all trace class operators with the trace norm IIAII= Tr(IAI). The Banach space of trace class operators is itself the 
dual of the C*-algebra of compact operators (which is not a von Neumann algebra). 

Weights, states, and traces 

Weights and their special cases states and traces are discussed in detail in (Takesaki 1979). 

• A weight oo on a von Neumann algebra is a linear map from the set of positive elements (those of the form a a) to 

[0,oo]. 

• A positive linear functional is a weight with co(l) finite (or rather the extension of oo to the whole algebra by 
linearity). 

• A state is a weight with co(l)=l. 

• A trace is a weight with oo(aa )=oo(a a) for all a. 

• A tracial state is a trace with co(l)=l. 

Any factor has a trace such that the trace of a non-zero projection is non-zero and the trace of a projection is infinite 
if and only if the projection is infinite. Such a trace is unique up to rescaling. For factors that are separable or finite, 
two projections are equivalent if and only if they have the same trace. The type of a factor can be read off from the 
possible values of this trace as follows: 

• Type 1^: 0, x, 2x, ....,nx for some positive x (usually normalized to be l/n or 1). 

• Type I : 0, x, 2x, ....,©<> for some positive x (usually normalized to be 1). 

• Type 11^ [0,x] for some positive x (usually normalized to be 1). 

• Type llj [0,oo]. 

• Type III: 0,«>. 

If a von Neumann algebra acts on a Hilbert space containing a norm 1 vector v, then the functional a —> (av,v) is a 
normal state. This construction can be reversed to give an action on a Hilbert space from a normal state: this is the 
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GNS construction for normal states. 

Modules over a factor 

Given an abstract separable factor, one can ask for a classification of its modules, meaning the separable Hilbert 
spaces that it acts on. The answer is given as follows: every such module H can be given an M-dimension dim^(7f) 
(not its dimension as a complex vector space) such that modules are isomorphic if and only if they have the same 
M-dimension. The M-dimension is additive, and a module is isomorphic to a subspace of another module if and only 
if it has smaller or equal M-dimension. 

A module is called standard if it has a cyclic separating vector. Each factor has a standard representation, which is 
unique up to isomorphism. The standard representation has an antilinear involution / such that JMJ = M'. For finite 
factors the standard module is given by the GNS construction applied to the unique normal tracial state and the 
M-dimension is normalized so that the standard module has M-dimension 1, while for infinite factors the standard 
module is the module with M-dimension equal to ©o. 

The possible M-dimensions of modules are given as follows: 

• Type I (n finite): The M-dimension can be any of 0/n, 1/n, 21 n, 3/n, <*>. The standard module has M-dimension 

n ^ 
1 (and complex dimension n .) 

• Type I The M-dimension can be any of 0, 1, 2, 3, ©o. The standard representation of B(H) is H®H; its 
M-dimension is °o. 

• Type 11^: The M-dimension can be anything in [0, «>]. It is normalized so that the standard module has 
M-dimension 1 . The M-dimension is also called the coupling constant of the module H. 

• Type II Q : The M-dimension can be anything in [0, °o]. There is in general no canonical way to normalize it; the 
factor may have outer automorphisms multiplying the M-dimension by constants. The standard representation is 
the one with M-dimension ©o. 

• Type III: The M-dimension can be 0 or ©©. Any two non-zero modules are isomorphic, and all non-zero modules 
are standard. 

Amenable von Neumann algebras 

Connes (1976) and others proved that the following conditions on a von Neumann algebra M on a separable Hilbert 
space H are all equivalent: 

• M is hyperfinite or AFD or approximately finite dimensional or approximately finite: this means the algebra 
contains an ascending sequence of finite dimensional subalgebras with dense union. (Warning: some authors use 
"hyperfinite" to mean "AFD and finite".) 

• M is amenable: this means that the derivations of M with values in a normal dual Banach bimodule are all inner. 

• M has Schwartz's property P: for any bounded operator TonH the weak operator closed convex hull of the 
elements uTu contains an element commuting with M. 

• M is semidiscrete: this means the identity map from M to M is a weak pointwise limit of completely positive 
maps of finite rank. 

• M has property E or the Hakeda-Tomiyama extension property: this means that there is a projection of norm 1 
from bounded operators on H to M '. 

• M is injective: any completely positive linear map from any self adjoint closed subspace containing 1 of any 
unital C -algebra A to M can be extended to a completely positive map from A to M. 

There is no generally accepted term for the class of algebras above; Connes has suggested that amenable should be 
the standard term. 
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The amenable factors have been classified: there is a unique one of each of the types 1,1 , IL , II , III , for 0<k< 1, 

A J ^ n oo 1 oo a, 

and the ones of type III 0 correspond to certain ergodic flows. (For type III 0 calling this a classification is a little 
misleading, as it is known that there is no easy way to classify the corresponding ergodic flows.) The ones of type I 
and II 1 were classified by Murray & von Neumann (1943), and the remaining ones were classified by Connes (1976), 
except for the type III case which was completed by Haagerup. 

All amenable factors can be constructed using the group-measure space construction of Murray and von Neumann 
for a single ergodic transformation. In fact they are precisely the factors arising as crossed products by free ergodic 
actions of Z or on abelian von Neumann algebras L°°(X). Type I factors occur when the measure space X is atomic 
and the action transitive. When X is diffuse or non-atomic, it is equivalent to [0,1] as a measure space. Type II 
factors occur when X admits an equivalent finite (II ) or infinite (11^ ) measure, invariant under Z • Type III factors 
occur in the remaining cases where there is no invariant measure, but only an invariant measure class: these factors 
are called Krieger factors. 



Tensor products of von Neumann algebras 

The Hilbert space tensor product of two Hilbert spaces is the completion of their algebraic tensor product. One can 
define a tensor product of von Neumann algebras (a completion of the algebraic tensor product of the algebras 
considered as rings), which is again a von Neumann algebra, and act on the tensor product of the corresponding 
Hilbert spaces. The tensor product of two finite algebras is finite, and the tensor product of an infinite algebra and a 
non-zero algebra is infinite. The type of the tensor product of two von Neumann algebras (I, II, or III) is the 
maximum of their types. The commutation theorem for tensor products states that 

(M ® N)' = M'®N' 
(where M' denotes the commutant of M). 

The tensor product of an infinite number of von Neumann algebras, if done naively, is usually a ridiculously large 
non-separable algebra. Instead von Neumann (1938) showed that one should choose a state on each of the von 
Neumann algebras, use this to define a state on the algebraic tensor product, which can be used to product a Hilbert 
space and a (reasonably small) von Neumann algebra. Araki & Woods (1968) studied the case where all the factors 
are finite matrix algebras; these factors are called Araki- Woods factors or ITPFI factors (ITPFI stands for "infinite 
tensor product of finite type I factors"). The type of the infinite tensor product can vary dramatically as the states are 
changed; for example, the infinite tensor product of an infinite number of type \^ factors can have any type 
depending on the choice of states. In particular Powers (1967) found an uncountable family of non-isomorphic 
hyperfinite type III factors for 0<X<1, called Powers factors, by taking an infinite tensor product of type I factors, 

each with the state given by : x \— > Tr ( ^Tlr 1 ^ J x. 

V 0 A+l/ 

All hyperfinite von Neumann algebras not of type are isomorphic to Araki- Woods factors, but there are 
uncountably many of type III 0 that are not. 

Bimodules and subfactors 

A bimodule (or correspondence) is a Hilbert space H with module actions of two commuting von Neumann 
algebras. Bimodules have a much richer structure than that of modules. Any bimodule over two factors always gives 
a subfactor since one of the factors is always contained in the commutant of the other. There is also a subtle relative 
tensor product operation due to Connes on bimodules. The theory of subfactors, initiated by Vaughan Jones, 
reconciles these two seemingly different points of view. 

Bimodules are also important for the von Neumann group algebra M of a discrete group P . Indeed if V is any 

unitary representation of T, then, regarding fas the diagonal subgroup of Px T, the corresponding induced 

2 

representation on / ( F,V) is naturally a bimodule for two commuting copies of M. Important representation 
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theoretic properties of Tcan be formulated entirely in terms of bimodules and therefore make sense for the von 
Neumann algebra itself. For example Connes and Jones gave a definition of an analogue of Kazhdan's Property T for 
von Neumann algebras in this way. 

Non-amenable factors 

Von Neumann algebras of type I are always amenable, but for the other types there are an uncountable number of 
different non-amenable factors, which seem very hard to classify, or even distinguish from each other. Nevertheless 
Voiculescu has shown that the class of non-amenable factors coming from the group-measure space construction is 
disjoint from the class coming from group von Neumann algebras of free groups. Later Narutaka Ozawa proved that 
group von Neumann algebras of hyperbolic groups yield prime type II factors, i.e. ones that cannot be factored as 
tensor products of type II 1 factors, a result first proved by Leeming Ge for free group factors using Voiculescu' s free 
entropy. Popa's work on fundamental groups of non-amenable factors represents another significant advance. The 
theory of factors "beyond the hyperfinite" is rapidly expanding at present, with many new and surprising results; it 
has close links with rigidity phenomena in geometric group theory and ergodic theory. 

Examples 

• The essentially bounded functions on a a-finite measure space form a commutative (type I ) von Neumann 

2 

algebra acting on the L functions. For certain non-a-finite measure spaces, usually considered pathological, 
L°°(X) is not a von Neumann algebra; for example, the a-algebra of measurable sets might be the 
countable-cocountable algebra on an uncountable set. 

• The bounded operators on any Hilbert space form a von Neumann algebra, indeed a factor, of type I. 

• If we have any unitary representation of a group G on a Hilbert space H then the bounded operators commuting 
with G form a von Neumann algebra G', whose projections correspond exactly to the closed subspaces of H 
invariant under G. Equivalent subrepresentations correspond to equivalent projections in G'. The double 
commutant G" of G is also a von Neumann algebra. 

2 

• The von Neumann group algebra of a discrete group G is the algebra of all bounded operators onH=l (G) 
commuting with the action of G on H through right multiplication. One can show that this is the von Neumann 
algebra generated by the operators corresponding to multiplication from the left with an element g G G. It is a 
factor (of type 11^ if every non-trivial conjugacy class of G is infinite (for example, a non-abelian free group), 
and is the hyperfinite factor of type II if in addition G is a union of finite subgroups (for example, the group of all 
permutations of the integers fixing all but a finite number of elements). 

• The tensor product of two von Neumann algebras, or of a countable number with states, is a von Neumann 
algebra as described in the section above. 

• The crossed product of a von Neumann algebra by a discrete (or more generally locally compact) group can be 
defined, and is a von Neumann algebra. Special cases are the group-measure space construction of Murray and 
von Neumann and Krieger factors. 

• The von Neumann algebras of a measurable equivalence relation and a measurable groupoid can be defined. 
These examples generalise von Neumann group algebras and the group-measure space construction. 
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Applications 

Von Neumann algebras have found applications in diverse areas of mathematics like knot theory, statistical 
mechanics, Quantum field theory, Local quantum physics, Free probability, Noncommutative geometry, 
representation theory, geometry, and probability. 
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C*-algebra 

C*-algebras (pronounced "C-star") are an important area of research in functional analysis, a branch of 
mathematics. The prototypical example of a C*-algebra is a complex algebra A of linear operators on a complex 
Hilbert space with two additional properties: 

• A is a topologically closed set in the norm topology of operators. 

• A is closed under the operation of taking adjoints of operators. 

It is generally believed that C*-algebras were first considered primarily for their use in quantum mechanics to model 
algebras of physical observables. This line of research began with Werner Heisenberg's matrix mechanics and in a 
more mathematically developed form with Pascual Jordan around 1933. Subsequently John von Neumann attempted 
to establish a general framework for these algebras which culminated in a series of papers on rings of operators. 
These papers considered a special class of C*-algebras which are now known as von Neumann algebras. 

Around 1943, the work of Israel Gelfand and Mark Naimark yielded an abstract characterisation of C*-algebras 
making no reference to operators. 

C*-algebras are now an important tool in the theory of unitary representations of locally compact groups, and are 
also used in algebraic formulations of quantum mechanics. 

Abstract characterization 

We begin with the abstract characterization of C*-algebras given in the 1943 paper by Gelfand and Naimark. 

A C*-algebra, A, is a Banach algebra over the field of complex numbers, together with a map, * : A — » A, called an 
involution. The image of an element x of A under the involution is written x*. Involution has the following 
properties: 

• For all x, y in A: 

(x + yY = x* + y* 

(xy)* = y*x* 

• For every X in C and every x in A: 

(\x)* = Ax*. 

• For all x in A 

{x*y = x 

• The C*-identity holds for all x in A: 

||x*x|| = ||x||||x*||. 
Note that the C* identity is equivalent to: for all x in A: 

\\xx*\\ = ||x||||x*||. 

This relation is equivalent to ||xx*|| = ||x|| 2 , which is sometimes called the B*-identity. For history behind the 
names C*- and B*-algebras, see the history section below. 

The C* -identity is a very strong requirement. For instance, together with the spectral radius formula, it implies the 
C*-norm is uniquely determined by the algebraic structure: 

||x|| 2 = ||x*x|| = sup{|A| : x*x — A 1 is not invertible}. 
A bounded linear map, it : A — » B, between C*-algebras A and B is called a *-homomorphism if 

• For x and y in A 

7r(xy) = 7r(x)7r(y) 
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• For x in A 

<jr{x*) = 7t(x)* 

In the case of C*-algebras, any *-homomorphism it between C*-algebras is non-expansive, i.e. bounded with norm < 
1. Furthermore, an injective *-homomorphism between C*-algebras is isometric. These are consequences of the 
C* -identity. 

A bijective *-homomorphism jt is called a C* -isomorphism, in which case A and B are said to be isomorphic. 

Some history: B*-algebras and C*-algebras 

The term B*-algebra was introduced by C. E. Rickart in 1946 to describe Banach *-algebras that satisfy the 
condition 

2 

• IIjc jc*II = IIjcII for all x in the given B* -algebra. (B* -condition) 

This condition automatically implies that the ^-involution is isometric, that is, = IIjcII. Hence \\x x*ll = llxll ILx*ll, 
and therefore, a B* -algebra is a C* -algebra. Conversely, the C* -condition implies the B* -condition. This is 
nontrivial, and can be proved without using the condition llxll=llx*ll. (For details, see R. S. Doran, V. A. Belfi, 
Characterizations of C*- Algebras — the Gelfand-Naimark Theorems, CRC, 1986.) 

For these reasons, the term B*-algebra is rarely used in current terminology, and has been replaced by the term 'C* 
algebra'. 

The term C*-algebra was introduced by I. E. Segal in 1947 to describe norm-closed subalgebras of B(H), namely, 
the space of bounded operators on some Hilbert space H. 'C stood for 'closed'. 

Examples 

Finite-dimensional C* -algebras 

The algebra M n (C) of n-by-n matrices over C becomes a C* -algebra if we consider matrices as operators on the 
Euclidean space, C n , and use the operator norm 11.11 on matrices. The involution is given by the conjugate transpose. 
More generally, one can consider finite direct sums of matrix algebras. In fact, all finite dimensional C*-algebras are 
of this form. The self-adjoint requirement means finite-dimensional C*-algebras are semisimple, from which fact 
one can deduce the following theorem of Artin-Wedderburn type: 

Theorem. A finite-dimensional C*-algebra, A, is canonically isomorphic to a finite direct sum 
A= 0 Ae 

eEmin A 

where min A is the set of minimal nonzero self-adjoint central projections of A. 

Each C*-algebra, Ae, is isomorphic (in a noncanonical way) to the full matrix algebra M dim ( e )(Q- The finite family 
indexed on min A given by {dim(e)} e is called the dimension vector of A. This vector uniquely determines the 
isomorphism class of a finite-dimensional C*-algebra. 
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C* -algebras of operators 

The prototypical example of a C*-algebra is the algebra B(H) of bounded (equivalently continuous) linear operators 
defined on a complex Hilbert space H; here x* denotes the adjoint operator of the operator x : H — » H. In fact, every 
C*-algebra, A, is * -isomorphic to a norm-closed adjoint closed subalgebra of B(H) for a suitable Hilbert space, H\ 
this is the content of the Gelfand-Naimark theorem. 

Commutative C* -algebras 

Let X be a locally compact Hausdorff space. The space C Q (X) of complex- valued continuous functions on X that 
vanish at infinity (defined in the article on local compactness) form a commutative C*-algebra C (X) under 
pointwise multiplication and addition. The involution is pointwise conjugation. C Q (X) has a multiplicative unit 
element if and only if X is compact. As does any C*-algebra, C (X) has an approximate identity. In the case of C (X) 
this is immediate: consider the directed set of compact subsets of X, and for each compact K let f be a function of 
compact support which is identically 1 on K. Such functions exist by the Tietze extension theorem which applies to 
locally compact Hausdorff spaces, {flis an approximate identity. 

K K 

The Gelfand representation states that every commutative C*-algebra is ^-isomorphic to the algebra C Q (X), where X 
is the space of characters equipped with the weak* topology. Furthermore if C (X) is isomorphic to C (Y) as 
C*-algebras, it follows that X and Y are homeomorphic. This characterization is one of the motivations for the 
noncommutative topology and noncommutative geometry programs. 

C*-algebras of compact operators 

Let H be a separable infinite-dimensional Hilbert space. The algebra K(H) of compact operators on H is a norm 
closed subalgebra of B(H). It is also closed under involution; hence it is a C*-algebra. 

Concrete C*-algebras of compact operators admit a characterization similar to Wedderburn's theorem for finite 
dimensional C*-algebras. 

Theorem. If A is a C* -subalgebra of K(H), then there exists Hilbert spaces {H.} . e l such that A is isomorphic to the 
following direct sum 

iei 

where the (C*-)direct sum consists of elements (7\) of the Cartesian product n K(H) with 117.11 — » 0. 

Though K(H) does not have an identity element, a sequential approximate identity for K(H) can be easily displayed. 
To be specific, H is isomorphic to the space of square summable sequences / ; we may assume that 

H = e. 

2 

For each natural number n let be the subspace of sequences of / which vanish for indices 
k > n 

and let 

be the orthogonal projection onto H^. The sequence {e^t n is an approximate identity for K(H). 

K(H) is a two-sided closed ideal of B(H). For separable Hilbert spaces, it is the unique ideal. The quotient of B(H) by 
K(H) is the Calkin algebra. 
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C* -enveloping algebra 

Given a B*-algebra A with an approximate identity, there is a unique (up to C* -isomorphism) C*-algebra E(A) and 
*-morphism jt from A into E(A) which is universal, that is, every other B*-morphism jt ' : A — » 5 factors uniquely 
through jt. The algebra E(A) is called the C*-enveloping algebra of the B*-algebra A. 

Of particular importance is the C*-algebra of a locally compact group G. This is defined as the enveloping 
C*-algebra of the group algebra of G. The C*-algebra of G provides context for general harmonic analysis of G in 
the case G is non-abelian. In particular, the dual of a locally compact group is defined to be the primitive ideal space 
of the group C*-algebra. See spectrum of a C*-algebra. 

von Neumann algebras 

von Neumann algebras, known as W* algebras before the 1960s, are a special kind of C*-algebra. They are required 
to be closed in the weak operator topology, which is weaker than the norm topology. Their study is a specialized area 
of functional analysis. 

Properties of C*-algebras 

C*-algebras have a large number of properties that are technically convenient. These properties can be established by 
use the continuous functional calculus or by reduction to commutative C*-algebras. In the latter case, we can use the 
fact that the structure of these is completely determined by the Gelfand isomorphism. 

• The set of elements of a C*-algebra A of the form x*x forms a closed convex cone. This cone is identical to the 
elements of the form x x*. Elements of this cone are called non-negative (or sometimes positive, even though this 
terminology conflicts with its use for elements of R.) 

• The set of self-adjoint elements of a C*-algebra A naturally has the structure of a partially ordered vector space; 
the ordering is usually denoted >. In this ordering, a self-adjoint element x of A satisfies x > 0 if and only if the 
spectrum of x is non-negative. Two self-adjoint elements x and y of A satisfy x > y if x - y > 0. 

• Any C*-algebra A has an approximate identity. In fact, there is a directed family {e^}^ g j of self-adjoint elements 
of A such that 

xe\ — ► x 

0 < e x < < 1 whenever A < fi. 

In case A is separable, A has a sequential approximate identity. More generally, A will have a sequential 
approximate identity if and only if A contains a strictly positive element, i.e. a positive element h such that 
hAh is dense in A. 

• Using approximate identities, one can show that the algebraic quotient of a C* -algebra by a closed proper 
two-sided ideal, with the natural norm, is a C*-algebra. 

• Similarly, a closed two-sided ideal of a C*-algebra is itself a C*-algebra. 
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Type for C*-algebras 

A C*-algebra A is of type I if and only if for all non-degenerate representations jt of A the von Neumann algebra 
7t(A)" (that is, the bicommutant of Jt(A)) is a type I von Neumann algebra. In fact it is sufficient to consider only 
factor representations, i.e. representations jt for which jt(A)" is a factor. 

A locally compact group is said to be of type I if and only if its group C*-algebra is type I. 

However, if a C*-algebra has non-type I representations, then by results of James Glimm it also has representations 
of type II and type III. Thus for C*-algebras and locally compact groups, it is only meaningful to speak of type I and 
non type I properties. 

C*-algebras and quantum field theory 

In quantum field theory, one typically describes a physical system with a C* -algebra A with unit element; the 
self-adjoint elements of A (elements x with x* = x) are thought of as the observables, the measurable quantities, of 
the system. A state of the system is defined as a positive functional on A (a C-linear map cp : A — > C with cp(w* u) > 
0 for all wGA) such that (p(l) = 1. The expected value of the observable x, if the system is in state cp, is then cp(x). 

See Local quantum physics. 
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Kac-Moody algebra 

In mathematics, a Kac-Moody algebra (named for Victor Kac and Robert Moody, who independently discovered 
them) is a Lie algebra, usually infinite-dimensional, that can be defined by generators and relations through a 
generalized Cartan matrix. These algebras form a generalization of finite-dimensional semisimple Lie algebras, and 
many properties related to the structure of a Lie algebra such as its root system, irreducible representations, and 
connection to flag manifolds have natural analogues in the Kac-Moody setting. A class of Kac-Moody algebras 
called affine Lie algebras is of particular importance in mathematics and theoretical physics, especially conformal 
field theory and the theory of exactly solvable models. Kac discovered an elegant proof of certain combinatorial 
identities, Macdonald identities, which is based on the representation theory of affine Kac-Moody algebras. Garland 
and Lepowski demonstrated that Rogers-Ramanujan identities can be derived in a similar fashion. 



Definition 

A Kac-Moody algebra is given by the following: 

1. An nxn generalized Cartan matrix C = (c.) of rank r. 

2. A vector space {) over the complex numbers of dimension In - r. 

3. A set of n linearly independent elements on of {) and a set of n linearly independent elements a* of the dual 
space, such that = Qj . The on are known as coroots, while the a* are known as roots. 

The Kac-Moody algebra is the Lie algebra 0 defined by generators and fi and the elements of {) and relations 

[^ij fi\ 

• [ e U fj] = 0 f or * ^ 3 

• [e;, x] = a*(x)ei , for x £ \) 

' [fi,x] = -<**(*)/;, for x e f) 

• [x, x'] = Ofor x,x' G fj 

• ad(e i ) 1 -^(e j ) = 0 

• ad(/ i ) 1 -^(/ i ) = 0 

where ad : g — > End(fl), ad(x)(y) = [x, y]is the adjoint representation of 0. 

A real (possibly infinite-dimensional) Lie algebra is also considered a Kac-Moody algebra if its complexification is 
a Kac-Moody algebra. 



Interpretation 

f) is a Cartan subalgebra of the Kac-Moody algebra. 
If g is an element of the Kac-Moody algebra such that 
Vx€ f), [g,x] =u){x)g 

where oo is an element of {)* , then g is said to have weight oo. The Kac-Moody algebra can be diagonalized into 
weight eigenvectors. The Cartan subalgebra h has weight zero, e. has weight a* and/, has weight -a*.. If the Lie 
bracket of two weight eigenvectors is nonzero, then its weight is the sum of their weights. The condition 
[e^ 3 fj] = Ofor i ^ j simply means the a* are simple roots. 
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Types of Kac-Moody algebras 

Properties of a Kac-Moody algebra are controlled by the algebraic properties of its generalized Cartan matrix C. In 
order to classify Kac-Moody algebras, it is enough to consider the case of an indecomposable matrixC, that is, 
assume that there is no decomposition of the set of indices / into a disjoint union of non-empty subsets / and / such 
that C.j = 0 for all i in 1^ and j in 1^. Any decomposition of the generalized Cartan matrix leads to the direct sum 
decomposition of the corresponding Kac-Moody algebra: 

fl (C)~ fl(Ci)0 0(C 2 ), 

where the two Kac-Moody algebras in the right hand side are associated with the submatrices of C corresponding to 
the index sets 1^ and l^. 

An important subclass of Kac-Moody algebras corresponds to symmetrizable generalized Cartan matrices C, which 
can be decomposed as DS, where D is a diagonal matrix with positive integer entries and S is a symmetric matrix. 
Under the assumptions that C is symmetrizable and indecomposable, the Kac-Moody algebras are divided into three 
classes: 

• A positive definite matrix S gives rise to a finite-dimensional simple Lie algebra. 

• A positive semidefinite matrix S gives rise to an infinite-dimensional Kac-Moody algebra of affine type, or an 
affine Lie algebra. 

• An indefinite matrix S gives rise to a Kac-Moody algebra of indefinite type. 

• Since the diagonal entries of C and S are positive, S cannot be negative definite or negative semidefinite. 

Symmetrizable indecomposable generalized Cartan matrices of finite and affine type have been completely 
classified. They correspond to Dynkin diagrams and affine Dynkin diagrams. Very little is known about the 
Kac-Moody algebras of indefinite type. Among those, the main focus has been on the (generalized) Kac-Moody 
algebras of hyperbolic type, for which the matrix S is indefinite, but for each proper subset of /, the corresponding 
submatrix is positive definite or positive semidefinite. Such matrices have rank at most 10 and have also been 
completely determined. 
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Spectral theory 

In mathematics, spectral theory is an inclusive term for theories extending the eigenvector and eigenvalue theory of 
a single square matrix to a much broader theory of the structure of operators in a variety of mathematical spaces J ^ It 
is a result of studies of linear algebra and the solutions of systems of linear equations and their generalizations. The 
theory is connected to that of analytic functions because the spectral properties of an operator are related to analytic 
functions of the spectral parameter. 



Mathematical background 

The name spectral theory was introduced by David Hilbert in his original formulation of Hilbert space theory, which 
was cast in terms of quadratic forms in infinitely many variables. The original spectral theorem was therefore 
conceived as a version of the theorem on principal axes of an ellipsoid, in an infinite-dimensional setting. The later 
discovery in quantum mechanics that spectral theory could explain features of atomic spectra was therefore 
fortuitous. 

There have been three main ways to formulate spectral theory, all of which retain their usefulness. After Hilbert's 
initial formulation, the later development of abstract Hilbert space and the spectral theory of a single normal operator 
on it did very much go in parallel with the requirements of physics; particularly at the hands of von Neumann ^ The 
further theory built on this to include Banach algebras, which can be given abstractly. This development leads to the 
Gelfand representation, which covers the commutative case, and further into non-commutative harmonic analysis. 

The difference can be seen in making the connection with Fourier analysis. The Fourier transform on the real line is 
in one sense the spectral theory of differentiation qua differential operator. But for that to cover the phenomena one 
has already to deal with generalized eigenfunctions (for example, by means of a rigged Hilbert space). On the other 
hand it is simple to construct a group algebra, the spectrum of which captures the Fourier transform's basic 
properties, and this is carried out by means of Pontryagin duality. 

One can also study the spectral properties of operators on Banach spaces. For example, compact operators on Banach 
spaces have many spectral properties similar to that of matrices. 



Physical background 

The background in the physics of vibrations has been explained in this way:^ 

Spectral theory is connected with the investigation of localized vibrations of a variety of different objects, from atoms and molecules in 
chemistry to obstacles in acoustic waveguides. These vibrations have frequencies, and the issue is to decide when such localized vibrations 
occur, and how to go about computing the frequencies. This is a very complicated problem since every object has not only a fundamental tone 
but also a complicated series of overtones, which vary radically from one body to another. 

The mathematical theory is not dependent on such physical ideas on a technical level, but there are examples of 
mutual influence (see for example Mark Kac's question Can you hear the shape of a drum?). Hilbert's adoption of 
the term "spectrum" has been attributed to an 1897 paper of Wilhelm Wirtinger on Hill's equation (by Jean 
Dieudonne), and it was taken up by his students during the first decade of the twentieth century, among them Erhard 
Schmidt and Hermann Weyl. The conceptual basis for Hilbert space was developed from Hilbert's ideas by Erhard 
Schmidt and Frigyes Riesz. [6] [7] It was almost twenty years later, when quantum mechanics was formulated in terms 
of the Schrodinger equation, that the connection was made to atomic spectra; a connection with the mathematical 
physics of vibration had been suspected before, as remarked by Henri Poincare, but rejected for simple quantitative 
reasons, absent an explanation of the B aimer series. The later discovery in quantum mechanics that spectral theory 
could explain features of atomic spectra was therefore fortuitous, rather than being an object of Hilbert's spectral 
theory. 
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A definition of spectrum 

Consider a bounded linear transformation T defined everywhere over a general Banach space. We form the 
transformation: 

i? c = (C / - T)- 1 . 

Here / is the identity operator and E, is a complex number. The inverse of an operator T, that is T~\ is defined by: 

rp rp— 1 rji—1 rp j 

If the inverse exists, Tis called regular. If it does not exist, Tis called singular. 

With these definitions, the resolvent set of T is the set of all complex numbers E, such that R exists and is bounded. 
This set often is denoted as q(T). The spectrum of T is the set of all complex numbers C, such that R^ fails to exist or 
is unbounded. Often the spectrum of T is denoted by a(T). The function R for all t> m P(D (that is, wherever R 
exists) is called the resolvent of T. The spectrum of T is therefore the complement of the resolvent set of T in the 
complex plane P^ Every eigenvalue of T belongs to a(T), but a(T) may contain non-eigenvalues J 10] 

This definition applies to a Banach space, but of course other types of space exist as well, for example, topological 

vector spaces include Banach spaces, but can be more general J 1 ^ ^ On the other hand, Banach spaces include 

ri3i 

Hilbert spaces, and it is these spaces that find the greatest application and the richest theoretical results. With 
suitable restrictions, much can be said about the structure of the spectra of transformations in a Hilbert space. In 
particular, for self-adjoint operators, the spectrum lies on the real line and (in general) is a spectral combination of a 
point spectrum of discrete eigenvalues and a continuous spectrum P 4 ^ 

What is spectral theory, roughly speaking? 

In functional analysis and linear algebra the spectral theorem establishes conditions under which an operator can be 
expressed in simple form as a sum of simpler operators. As a full rigorous presentation is not appropriate for this 
article, we take an approach that avoids much of the rigor and satisfaction of a formal treatment with the aim of 
being more comprehensible to a non- specialist. 

This topic is easiest to describe by introducing the bra-ket notation of Dirac for operators/ 15 ^ ^ As an example, a 

ri7i risi 

very particular linear operator L might be written as a dyadic product: 

L=\k 1 ){b 1 \, 

in terms of the "bra" ( b and the "ket" k ) . A function /is described by a ket as / ) . The function fix) defined 
on the coordinates (x jy x^... ) is denoted as: 

/(*) = /) , 

and the magnitude of /by: 



(/, f)= Jdx {/, x)(x, f)= jdx r(x)f(x) , 



where the notation '*' denotes a complex conjugate. This inner product choice defines a very specific inner product 

ri3i 

space, restricting the generality of the arguments that follow. 
The effect of L upon a function /is then described as: 

L\f) = \k 1 ){b,\f) 

expressing the result that the effect of L on / is to produce a new function | k\ ) multiplied by the inner product 
represented by • 

A more general linear operator L might be expressed as: 

L = AildX/il + AaleaX/al + A 3 |e 3 ></ 3 | + . . . , 
where the { X. } are scalars and the { |e^) } are a basis and the { } a reciprocal basis for the space. The 

relation between the basis and the reciprocal basis is described, in part, by: 
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(fi\ e j) = S ij 

If such a formalism applies, the { } are eigenvalues of L and the functions { | e^) } are eigenfunctions of L. 

ri9i 

The eigenvalues are in the spectrum of L. 

Some natural questions are: under what circumstances does this formalism work, and for what operators L are 
expansions in series of other operators like this possible? Can any function / be expressed in terms of the 
eigenfunctions (are they a complete set) and under what circumstances does a point spectrum or a continuous 
spectrum arise? How do the formalisms for infinite dimensional spaces and finite dimensional spaces differ, or do 
they differ? Can these ideas be extended to a broader class of spaces? Answering such questions is the realm of 
spectral theory and requires considerable background in functional analysis and matrix algebra. 

Resolution of the identity 

This section continues in the rough and ready manner of the above section using the bra-ket notation, and glossing 
over the many important and fascinating details of a rigorous treatment P^ A rigorous mathematical treatment may 
be found in various references. 1 

Using the bra-ket notation of the above section, the identity operator may be written as: 

i=i 

where it is supposed as above that { \q) } are a basis and the { (f { \ } a reciprocal basis for the space satisfying 
the relation: 

{fi\ e j) 

This expression of the identity operation is called a representation or a resolution of the identity P^ This formal 
representation satisfies the basic property of the identity: 

I n = I 

valid for every positive integer n. 

Applying the resolution of the identity to any function in the space , one obtains: 



WHV>) = El e ;></#> = £*> 



i=l i=l 

[221 

which is the generalized Fourier expansion of ip in terms of the basis functions { e }. 
Given some operator equation of the form: 

o\+) = \h) 

with h in the space, this equation can be solved in the above basis through the formal manipulations: 



o|^> = E^(°l«*» = El c *>(/*l fc >. 



i=i i=i 

n ri 

(fjW) = 5></il°l*> = B/ilesX/ilfc) = {fj\h), Vj 

i=l 1=1 

which converts the operator equation to a matrix equation determining the unknown coefficients c in terms of the 
generalized Fourier coefficients (fj\h) ofh and the matrix elements Oji = (fj\0\e{) of the operator O. 

The role of spectral theory arises in establishing the nature and existence of the basis and the reciprocal basis. In 
particular, the basis might consist of the eigenfunctions of some linear operator L: 

L\ei) = \i\ei) ; 

with the { X. } the eigenvalues of L from the spectrum of L. Then the resolution of the identity above provides the 
dyad expansion of L: 
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LI = L = J2L\e i )(fi\ = E ^iHfil 
i=i i=i 

Resolvent operator 

Using spectral theory, the resolvent operator R: 

R=(X-L)-\ 

can be evaluated in terms of the eigenf unctions and eigenvalues of L, and the Green's function corresponding to L 
can be found. 

Applying R to some arbitrary function in the space, say cp, 

R\<p) = {\-L)-' \v) = Y»^ 1 ±^\e i ){f h <p). 

This function has poles in the complex A-plane at each eigenvalue of L. Thus, using the calculus of residues: 
1 



2m h d\{\ - L)~ l \<p) = -E? =1 \et) (f h <p) = 
where the line integral is over a contour C that includes all the eigenvalues of L. 
Suppose our functions are defined over some coordinates { x }, that is: 

<z, if) = <p(z u x 2 ,... ), 
where the bra-kets corresponding to { x } satisfy. 

(x, y) =6(x-y), 

and where 6 (x - y) = 6 (x ] - y Jt x 2 - x 3 - ...) is the Dirac delta function. 174 ^ 
Then: 

(*• h £ « A < A - L '" v ) = h £ iX <*■ < A - "> 

= h £ iX j dy {x ' (A " y) {y ' v) 

The function G(x, y; X) defined by: 

G(x, y; A) = (x, (A - L) _1 y) 

= E? =1 E? =1 <x, «)</,, (A - L)~ 1 ej)(fj, y) 
(x, ei)(fi, y) 



— ^1=1 

— ^1=1 



A-A; 
*(*)tf(v) 



A — Aj ' 

T251 

is called the Green's function for operator L, and satisfies: 



^- £ dA G(x, y; A) = -E? =1 (z, y) = -(x, y) = -S(x - y). 
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Operator equations 

Consider the operator equation: 

(O-XI) \4>) = \h); 

in terms of coordinates: 

J dy(x, (0-XI)y)(y, ^)=h{x). 

A particular case is X = 0. 

The Green's function of the previous section is: 

(y, G(X)z) = (y, (O - A/)" 1 *) = G(y, z; X) , 

and satisfies: 

J dy{x,(0- XI) y)(y, G(X)z) = f dy(x,(0- XI) y)(y, (O - A/)" 1 z) = (x, z) = S(x - z) . 
Using this Green's function property: 

J dy{x,(0- XI) y)G(y, z; A) = 5{x - z) . 
Then, multiplying both sides of this equation by h(z) and integrating: 

J dz h(z) j dy(x, (O- XI) y) G(y, z; A) = J dy (x, ( O - XI) y) J dz h(z) G(y, z; X) = h 

which suggests the solution is: 

V>0) = J dz h(z) G(x, z; A). 

That is, the function ip(x) satisfying the operator equation is found if we can find the spectrum of O, and construct G, 
for example by using: 

There are many other ways to find G, of course P 6 ^ See the articles on Green's functions and on Fredholm integral 
equations. It must be kept in mind that the above mathematics is purely formal, and a rigorous treatment involves 
some pretty sophisticated mathematics, including a good background knowledge of functional analysis, Hilbert 
spaces, distributions and so forth. Consult these articles and the references for more detail. 
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Quantum Field Theory, SUSY, Quantum 
Geometry and Quantum Algebraic Topology 

Quantum electrodynamics 

Quantum electrodynamics (QED) is the relativistic quantum field theory of electrodynamics. In essence, it 
describes how light and matter interact and is the first theory where full agreement between quantum mechanics and 
special relativity is achieved. QED mathematically describes all phenomena involving electrically charged particles 
interacting by means of exchange of photons and represents the quantum counterpart of classical electrodynamics 
giving a complete account of matter and light interaction. One of the founding fathers of QED, Richard Feynman, 
has called it "the jewel of physics" for its extremely accurate predictions of quantities like the anomalous magnetic 
moment of the electron, and the Lamb shift of the energy levels of hydrogen 

In technical terms, QED can be described as a perturbation theory of the electromagnetic quantum vacuum. 



History 

The first formulation of a quantum theory describing radiation and matter interaction is due to Paul Adrien Maurice 
Dirac, who, during 1920, was first able to compute the coefficient of spontaneous emission of an atom. 1 




Dirac described the quantization of the electromagnetic field as an ensemble of harmonic 
oscillators with the introduction of the concept of creation and annihilation operators of 
particles. In the following years, with contributions from Wolfgang Pauli, Eugene 
Wigner, Pascual Jordan, Werner Heisenberg and an elegant formulation of quantum 
electrodynamics due to Enrico Fermi, 1 physicists came to believe that, in principle, it 
would be possible to perform any computation for any physical process involving 
photons and charged particles. However, further studies by Felix Bloch with Arnold 
Nordsieck,^ and Victor Weisskopf,^ in 1937 and 1939, revealed that such 
computations were reliable only at a first order of perturbation theory, a problem already 
pointed out by Robert OppenheimerJ 6 ^ At higher orders in the series infinities emerged, 
making such computations meaningless and casting serious doubts on the internal consistency of the theory itself. 
With no solution for this problem known at the time, it appeared that a fundamental incompatibility existed between 
special relativity and quantum mechanics . 

Difficulties with the theory increased through the end of 1940. Improvements in 
microwave technology made it possible to take more precise measurements of the shift 
of the levels of a hydrogen atom, now known as the Lamb shift and magnetic moment 

T81 

of the electron. These experiments unequivocally exposed discrepancies which the 
theory was unable to explain. 

A first indication of a possible way out was given by Hans Bethe. In 1947, while he was 

rm 

traveling by train to reach Schenectady from New York, after giving a talk at the 
conference at Shelter Island on the subject, Bethe completed the first non-relativistic 
computation of the shift of the lines of the hydrogen atom as measured by Lamb and 

Hans Bethe 
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RetherfordJ 10 ^ Despite the limitations of the computation, agreement was excellent. The idea was simply to attach 
infinities to corrections at mass and charge that were actually fixed to a finite value by experiments. In this way, the 
infinities get absorbed in those constants and yield a finite result in good agreement with experiments. This 
procedure was named renormalization. 

Based on Bethe's intuition and fundamental 

papers on the subject by Sin-Itiro 

TomonagaJ 11 ^ Julian Schwinger/ 12 ^ ^ 

Richard Feynman^ 14 ^ ^ 15 ^ ^ and Freeman 

Dyson [17] [18] , it was finally possible to get 

fully covariant formulations that were finite 

at any order in a perturbation series of 

quantum electrodynamics. Sin-Itiro 

Tomonaga, Julian Schwinger and Richard 

Feynman were jointly awarded with a Nobel 

prize in physics in 1965 for their work in 

this areaJ 19 ^ Their contributions, and those 

of Freeman Dyson, were about covariant 

and gauge invariant formulations of 

quantum electrodynamics that allow 

computations of observables at any order of 

perturbation theory. Feynman' s 

mathematical technique, based on his 

diagrams, initially seemed very different 

from the field- theoretic, operator-based 

approach of Schwinger and Tomonaga, but 

Freeman Dyson later showed that the two 

ri7i 

approaches were equivalent. 

Renormalization, the need to attach a 
physical meaning at certain divergences 
appearing in the theory through integrals, 
has subsequently become one of the 
fundamental aspects of quantum field theory 

and has come to be seen as a criterion for a theory's general acceptability. Even though renormalization works very 
well in practice, Feynman was never entirely comfortable with its mathematical validity, even referring to 
renormalization as a "shell game" and "hocus pocus".^ 




Shelter Island Conference group photo (Courtesy of Archives, National Academy 

of Sciences). 




Feynman (center) and Oppenheimer (left) 
at Los Alamos. 



QED has served as the model and template for all subsequent quantum field theories. One such subsequent theory is 
quantum chromodynamics, which began in the early 1960s and attained its present form in the 1975 work by H. 
David Politzer, Sidney Coleman, David Gross and Frank Wilczek. Building on the pioneering work of Schwinger, 
Gerald Guralnik, Dick Hagen, and Tom Kibble, Peter Higgs, Jeffrey Goldstone, and others, Sheldon 

Glashow, Steven Weinberg and Abdus Salam independently showed how the weak nuclear force and quantum 
electrodynamics could be merged into a single electroweak force. 
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Feynman's view of quantum electrodynamics 



Introduction 

Near the end of his life, Richard P. Feynman gave a series of lectures on QED intended for the lay public. These 
lectures were transcribed and published as Feynman (1985), QED: The strange theory of light and matter} 1 ^ ^ a 
classic non-mathematical exposition of QED from the point of view articulated below. 

The key components of Feynman's presentation of QED are three basic actions. 

• A photon goes from one place and time to another place and time. 

• An electron goes from one place and time to another place and time. 

• An electron emits or absorbs a photon at a certain place and time. 

These actions are represented in a form of 
visual shorthand by the three basic elements 
of Feynman diagrams: a wavy line for the 
photon, a straight line for the electron and a 
junction of two straight lines and a wavy 
one for a vertex representing emission or 
absorption of a photon by an electron. These 
may all be seen in the adjacent diagram. 



Time 



B 



D 




A 

PHOTON 



ELECTRON 



PHOTON EMISSION OR ABSORPTION 



Space 



Feynman diagram elements 



It is important not to over-interpret these 
diagrams. Nothing is implied about how a 
particle gets from one point to another. The 
diagrams do not imply that the particles are 
moving in straight or curved lines. They do 
not imply that the particles are moving with 
fixed speeds. The fact that the photon is often represented, by convention, by a wavy line and not a straight one does 
not imply that it is thought that it is more wavelike than is an electron. The images are just symbols to represent the 
actions above: photons and electrons do, somehow, move from point to point and electrons, somehow, emit and 
absorb photons. We do not know how these things happen, but the theory tells us about the probabilities of these 
things happening. Trajectory is a meaningless concept in quantum mechanics. 

As well as the visual shorthand for the actions Feynman introduces another kind of shorthand for the numerical 
quantities which tell us about the probabilities. If a photon moves from one place and time - in shorthand, A - to 
another place and time — shorthand, B - the associated quantity is written in Feynman's shorthand as P(A to B). The 
similar quantity for an electron moving from C to D is written E(C to D). The quantity which tells us about the 
probability for the emission or absorption of a photon he calls 'j'. This is related to, but not the same as, the measured 
electron charge 'e'. 

QED is based on the assumption that complex interactions of many electrons and photons can be represented by 
fitting together a suitable collection of the above three building blocks, and then using the probability-quantities to 
calculate the probability of any such complex interaction. It turns out that the basic idea of QED can be 
communicated while making the assumption that the quantities mentioned above are just our everyday probabilities. 
(A simplification of Feynman's book.) Later on this will be corrected to include specifically quantum mathematics, 
following Feynman. 

The basic rules of probabilities that will be used are that a) if an event can happen in a variety of different ways then 
its probability is the sum of the probabilities of the possible ways and b) if a process involves a number of 
independent subprocesses then its probability is the product of the component probabilities. 
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Basic constructions 




Suppose we start with one electron at a certain place and time (this place and time being given the arbitrary label A) 
and a photon at another place and time (given the label B). A typical question from a physical standpoint is: 'What is 
the probability of finding an electron at C (another place and a later time) and a photon at D (yet another place and 
time)?'. The simplest process to achieve this end is for the electron to move from A to C (an elementary action) and 
that the photon moves from B to D (another elementary action). From a knowledge of the probabilities of each of 
these subprocesses - E(A to C) and P(B to D) - then we would expect to calculate the probability of both happening 
by multiplying them, using rule b) above. This gives a simple estimated answer to our question. 

But there are other ways in which the end result could come about. 
The electron might move to a place and time E where it absorbs 
the photon; then move on before emitting another photon at F; 
then move on to C where it is detected, while the new photon 
moves on to D. The probability of this complex process can again 
be calculated by knowing the probabilities of each of the 
individual actions: three electron actions, two photon actions and 
two vertexes - one emission and one absorption. We would expect 
to find the total probability by multiplying the probabilities of each 
of the actions, for any chosen positions of E and F. We then, using 
rule a) above, have to add up all these probabilities for all the 
alternatives for E and F. (This is not elementary in practice, and 
involves integration.) But there is another possibility: that is that the electron first moves to G where it emits a 
photon which goes on to D, while the electron moves on to H, where it absorbs the first photon, before moving on to 
C. Again we can calculate the probability of these possibilities (for all points G and H). We then have a better 
estimation for the total probability by adding the probabilities of these two possibilities to our original simple 
estimate. Incidentally the name given to this process of a photon interacting with an electron in this way is Compton 
Scattering. 



Compton scattering 



There are an infinite number of other intermediate processes in which more and more photons are absorbed and/or 
emitted. For each of these possibilities there is a Feynman diagram describing it. This implies a complex 
computation for the resulting probabilities, but provided it is the case that the more complicated the diagram the less 
it contributes to the result, it is only a matter of time and effort to find as accurate an answer as one wants to the 
original question. This is the basic approach of QED. To calculate the probability of any interactive process between 
electrons and photons it is a matter of first noting, with Feynman diagrams, all the possible ways in which the 
process can be constructed from the three basic elements. Each diagram involves some calculation involving definite 
rules to find the associated probability. 

That basic scaffolding remains when one moves to a quantum description but some conceptual changes are 
requested. One is that whereas we might expect in our everyday life that there would be some constraints on the 
points to which a particle can move, that is not true in full quantum electrodynamics. There is a certain possibility of 
an electron or photon at A moving as a basic action to any other place and time in the universe. That includes places 
that could only be reached at speeds greater than that of light and also earlier times. (An electron moving backwards 
in time can be viewed as a positron moving forward in time.) 
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Probability amplitudes 




Addition of probability amplitudes as complex 
numbers 



Quantum mechanics introduces an important change on the way 
probabilities are computed. It has been found that the quantities 
which we have to use to represent the probabilities are not the 
usual real numbers we use for probabilities in our everyday world, 
but complex numbers which are called probability amplitudes. 
Feynman avoids exposing the reader to the mathematics of 
complex numbers by using a simple but accurate representation of 
them as arrows on a piece of paper or screen. (These must not be 
confused with the arrows of Feynman diagrams which are actually 
simplified representations in two dimensions of a relationship 
between points in three dimensions of space and one of time.) The 

amplitude-arrows are fundamental to the description of the world given by quantum theory. No satisfactory reason 
has been given for why they are needed. But pragmatically we have to accept that they are an essential part of our 
description of all quantum phenomena. They are related to our everyday ideas of probability by the simple rule that 
the probability of an event is the square of the length of the corresponding amplitude-arrow. So, for a given process, 
if two probability amplitudes, v and w, are involved, the probability of the process will be given either by 

P = |v + w| 2 

or 

p = | v x w| 2 - 

The rules as regards adding or multiplying, however, are the same as above. But where you would expect to add or 
multiply probabilities, instead you add or multiply probability amplitudes that now are complex numbers. 

Addition and multiplication are familiar operations in the theory of 
complex numbers and are given in the figures. The sum is found as 
follows. Let the start of the second arrow be at the end of the first. 
The sum is then a third arrow that goes directly from the start of 
the first to the end of the second. The product of two arrows is an 
arrow whose length is the product of the two lengths. The 
direction of the product is found by adding the angles that each of 
the two have been turned through relative to a reference direction: 
that gives the angle that the product is turned relative to the 
reference direction. 

That change, from probabilities to probability amplitudes, 
complicates the mathematics without changing the basic approach. But that change is still not quite enough because 
it fails to take into account the fact that both photons and electrons can be polarized, which is to say that their 
orientation in space and time have to be taken into account. Therefore P(A to B) actually consists of 16 complex 
numbers, or probability amplitude arrows. There are also some minor changes to do with the quantity "j", which may 
have to be rotated by a multiple of 90° for some polarizations, which is only of interest for the detailed bookkeeping. 

Associated with the fact that the electron can be polarized is another small necessary detail which is connected with 
the fact that an electron is a Fermion and obeys Fermi-Dirac statistics. The basic rule is that if we have the 
probability amplitude for a given complex process involving more than one electron, then when we include (as we 
always must) the complementary Feynman diagram in which we just exchange two electron events, the resulting 
amplitude is the reverse — the negative — of the first. The simplest case would be two electrons starting at A and B 
ending at C and D. The amplitude would be calculated as the "difference", E(A to B)xE(C to D) — E(A to C)xE(B to 
D), where we would expect, from our everyday idea of probabilities, that it would be a sum. 
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Multiplication of probability amplitudes as complex 
numbers 
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Propagators 

Finally, one has to compute P(A to B) and E (C to D) corresponding to the probability amplitudes for the photon and 
the electron respectively. These are essentially the solutions of the Dirac Equation which describes the behavior of 
the electron's probability amplitude and the Klein-Gordon equation which describes the behavior of the photon's 
probability amplitude. These are called Feynman propagators. The translation to a notation commonly used in the 
standard literature is as follows: 

P(A to B) -> D F (x B - x A ), E(C to D) -> S F (x D - x c ) 
where a shorthand symbol such as Xj± stands for the four real numbers which give the time and position in three 
dimensions of the point labeled A. 



Mass renormalization 



A problem arose historically which held up progress for twenty 
years: although we start with the assumption of three basic 
"simple" actions, the rules of the game say that if we want to 
calculate the probability amplitude for an electron to get from A to 
B we must take into account all the possible ways: all possible 
Feynman diagrams with those end points. Thus there will be a way 
in which the electron travels to C, emits a photon there and then 
absorbs it again at D before moving on to B. Or it could do this 
kind of thing twice, or more. In short we have a fractal-like 
situation in which if we look closely at a line it breaks up into a 
collection of "simple" lines, each of which, if looked at closely, are 
in turn composed of "simple" lines, and so on ad infinitum. This is 
a very difficult situation to handle. If adding that detail only 
altered things slightly then it would not have been too bad, but 
disaster struck when it was found that the simple correction mentioned above led to infinite probability amplitudes. 
In time this problem was "fixed" by the technique of renormalization (see below and the article on mass 
renormalization). However, Feynman himself remained unhappy about it, calling it a "dippy process" P 0 ^ 




Electron self-energy loop 



Conclusions 

Within the above framework physicists were then able to calculate to a high degree of accuracy some of the 
properties of electrons, such as the anomalous magnetic dipole moment. However, as Feynman points out, it fails 
totally to explain why particles such as the electron have the masses they do. "There is no theory that adequately 
explains these numbers. We use the numbers in all our theories, but we don't understand them — what they are, or 
where they come from. I believe that from a fundamental point of view, this is a very interesting and serious 
problem. "^ 
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Mathematics 

Mathematically, QED is an abelian gauge theory with the symmetry group U(l). The gauge field, which mediates 
the interaction between the charged spin- 1/2 fields, is the electromagnetic field. The QED Lagrangian for a spin- 1/2 
field interacting with the electromagnetic field is given by the real part of 

c = v^7 m £>m - - ^ V . 

where 

7^ are Dirac matrices; 

ij; a bispinor field of spin- 1/2 particles (e.g. electron-positron field); 

= i/^7o> called "psi-bar", is sometimes referred to as Dirac adjoint; 
Dp — dp + icAp + ieB^is the gauge covariant derivative; 
e is the coupling constant, equal to the electric charge of the bispinor field; 

is the covariant four-potential of the electromagnetic field generated by the electron itself; 
Bp is the external field imposed by external source; 

— d^Av — d^A^is the electromagnetic field tensor. 

Equations of motion 

To begin, substituting the definition of D into the Lagrangian gives us: 

£ = ijrfW ~ ehM* + - m W> ~ iff***"- 

Next, we can substitute this Lagrangian into the Euler-Lagrange equation of motion for a field: 



to find the field equations for QED. 

The two terms from this Lagrangian are then: 

— = -efa^A* + Bfl ) - 

Substituting these two back into the Euler-Lagrange equation (2) results in: 

iByfrf + e^(A^ + B^)+m^ = 0 
with complex conjugate: 

i-fdyft - ej^A* 1 + B^ip - rmj> = 0. 
Bringing the middle term to the right-hand side transforms this second equation into: 



The left-hand side is like the original Dirac equation and the right-hand side is the interaction with the 
electromagnetic field. 

One further important equation can be found by substituting the Lagrangian into another Euler-Lagrange equation, 
this time for the field, J^ 1 : 
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The two terms this time are: 

dC 

— = -e+f* 
and these two terms, when substituted back into (3) give us: 



Interaction picture 

This theory can be straightforwardly quantized treating bosonic and fermionic sectors as free. This permits to build a 
set of asymptotic states to start a computation of the probability amplitudes for different processes. In order to be 
able to do so, we have to compute an evolution operator that, for a given initial state, will give a final state in such a 
way to have 

M fi = (f\U\i). 

This technique is also known as the S-Matrix. Evolution operator is obtained in the interaction picture where time 
evolution is given by the interaction Hamiltonian. So, from equations above is 

V = e J Sx^^A^ 

and so, one has 

U = Texp I--*- f dt f V{t') 
L h J to 

being T the time ordering operator. This evolution operator has only a meaning as a series and what we get here is a 
perturbation series with a development parameter being fine structure constant. This series is named Dyson series. 

Feynman diagrams 

Despite the conceptual clarity of this Feynman approach to QED, almost no textbooks follow him in their 
presentation. When performing calculations it is much easier to work with the Fourier transforms of the propagators. 
Quantum physics considers particle's momenta rather than their positions, and it is convenient to think of particles as 
being created or annihilated when they interact. Feynman diagrams then look the same, but the lines have different 
interpretations. The electron line represents an electron with a given energy and momentum, with a similar 
interpretation of the photon line. A vertex diagram represents the annihilation of one electron and the creation of 
another together with the absorption or creation of a photon, each having specified energies and momenta. 

Using Wick theorem on the terms of the Dyson series, all the terms of the S-matrix for quantum electrodynamics can 
be computed through the technique of Feynman diagrams. In this case rules for drawing are the following 
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a fr- /3 




P 2 + 2£ 



-ze7^(27r) 4 ^(p 1+ p 2 +p3)^ 



Incoming antifermion: a • -» s) 

Outgoing fermion: • ^ a -> u a (p,s) 

Outgoing antifermion: • a -> '^a(p>s) 

Incoming photon: ^ *wx^v« -> 6^ (ft, A) 

Outgoing photon: r\-^\-^ ft e^(fc, A)* 

To these rules we must add a further one for closed loops that implies an integration on momenta J cftp / (2tt)^ • 

From them, computations of probability amplitudes are straightforwardly given. An example is Compton scattering, 
with an electron and a photon undergoing elastic scattering. Feynman diagrams are in this case 
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and so we are able to get the corresponding amplitude at the first order of a perturbation series for S -matrix: 
M fi = (ie) 2 u(p',sW(k'A') 



( p + fc )2_ m 2 

from which we are able to compute the cross section for this scattering. 



(p — k f ) 2 — m\ 



Renormalizability 

Higher order terms can be straightforwardly computed for the evolution operator but these terms display diagrams 
containing the following simpler ones 




One-loop contribution to the 
vacuum polarization function 

n 



One-loop contribution to the electron 
self-energy function Yj 




One-loop 
contribution 
to the vertex 
function P 



that, being closed loops, imply the presence of diverging integrals having no mathematical meaning. To overcome 
this difficulty, a technique like renormalization has been devised, producing finite results in very close agreement 
with experiments. It is important to note that a criterion for theory being meaningful after renormalization is that the 
number of diverging diagrams is finite. In this case the theory is said renormalizable. The reason for this is that to 
get observables renormalized one needs a finite number of constants to maintain the predictive value of the theory 
untouched. This is exactly the case of quantum electrodynamics displaying just three diverging diagrams. This 
procedure gives observables in very close agreement with experiment as seen e.g. for electron gyromagnetic ratio. 

Renormalizability has become an essential criterion for a quantum field theory to be considered as a viable one. All 
the theories describing fundamental interactions, except gravitation whose quantum counterpart is presently under 
very active research, are renormalizable theories. 
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Nonconvergence of series 

An argument by Freeman Dyson shows that the radius of convergence of the perturbation series in QED is zeroJ 24 ^ 
The basic argument goes as follows: if the coupling constant were negative, this would be equivalent to the Coulomb 
force constant being negative. This would "reverse" the electromagnetic interaction so that like charges would attract 
and unlike charges would repel. This would render the vacuum unstable against decay into a cluster of electrons on 
one side of the universe and a cluster of positrons on the other side of the universe. Because the theory is sick for any 
negative value of the coupling constant, the series do not converge, but are an asymptotic series. This can be taken as 
a need for a new theory, a problem with perturbation theory, or ignored by taking a "shut-up-and-calculate" 
approach. 
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Quantum field theory 

Quantum field theory (QFT)^ provides a theoretical framework for constructing quantum mechanical models of 
systems classically parametrized (represented) by an infinite number of dynamical degrees of freedom, that is, fields 
and (in a condensed matter context) many-body systems. It is the natural and quantitative language of particle 
physics and condensed matter physics. Most theories in modern particle physics, including the Standard Model of 
elementary particles and their interactions, are formulated as relativistic quantum field theories. Quantum field 
theories are used in many contexts, elementary particle physics being the most vital example, where the particle 
count/number going into a reaction fluctuates and changes, differing from the count/number going out, for example, 
and for the description of critical phenomena and quantum phase transitions, such as in the BCS theory of 
superconductivity, also see phase transition, quantum phase transition, critical phenomena. Quantum field theory is 
thought by many to be the unique and correct outcome of combining the rules of quantum mechanics with special 
relativity. 

In perturbative quantum field theory, the forces between particles are mediated by other particles. The 
electromagnetic force between two electrons is caused by an exchange of photons. Intermediate vector bosons 
mediate the weak force and gluons mediate the strong force. There is currently no complete quantum theory of the 
remaining fundamental force, gravity, but many of the proposed theories postulate the existence of a graviton 
particle that mediates it. These force-carrying particles are virtual particles and, by definition, cannot be detected 
while carrying the force, because such detection will imply that the force is not being carried. In addition, the notion 
of "force mediating particle" comes from perturbation theory, and thus does not make sense in a context of bound 
states. 

In QFT photons are not thought of as 'little billiard balls', they are considered to be field quanta - necessarily 
chunked ripples in a field, or "excitations," that 'look like' particles. Fermions, like the electron, can also be described 
as ripples/excitations in a field, where each kind of fermion has its own field. In summary, the classical visualisation 
of "everything is particles and fields," in quantum field theory, resolves into "everything is particles," which then 
resolves into "everything is fields." In the end, particles are regarded as excited states of a field (field quanta). The 
gravitational field and the electromagnetic field are the only two fundamental fields in Nature that have infinite range 
and a corresponding classical low-energy limit, which greatly diminishes and hides their "particle-like" excitations. 
Albert Einstein, in 1905, attributed "particle-like" and discrete exchanges of momenta and energy, characteristic of 
"field quanta," to the electromagnetic field. Originally, his principal motivation was to explain the thermodynamics 
of radiation. Although it is often claimed that the photoelectric and Compton effects require a quantum description of 
the EM field, this is now understood to be untrue, and proper proof of the quantum nature of radiation is now taken 
up into modern quantum optics as in the antibunching effect . The word "photon" was coined in 1926 by the great 
physical chemist Gilbert Newton Lewis (see also the articles photon antibunching and laser). 

The "low-energy limit" of the correct quantum field-theoretic description of the electromagnetic field, quantum 
electrodynamics, is believed to become James Clerk Maxwell's 1864 theory, although the "classical limit" of 
quantum electrodynamics has not been as widely explored as that of quantum mechanics. Presumably, the as yet 
unknown correct quantum field-theoretic treatment of the gravitational field will become and "look exactly like" 
Einstein's general theory of relativity in the "low-energy limit." Indeed, quantum field theory itself is quite possibly 
the low-energy-effective-field-theory limit of a more fundamental theory such as superstring theory. Compare in this 
context the article effective field theory. 
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History 

Quantum field theory originated in the 1920s from the problem of creating a quantum mechanical theory of the 
electromagnetic field. In 1925 Werner Heisenberg, Max Born, and Pascual Jordan constructed such a theory by 
expressing the field's internal degrees of freedom as an infinite set of harmonic oscillators and by employing the 
usual procedure for quantizing those oscillators. This theory assumed that no electric charges or currents were 
present, and today would be called a free field theory. The first reasonably complete theory of quantum 
electrodynamics, which included both the electromagnetic field and electrically charged matter (specifically, 
electrons) as quantum mechanical objects, was created by Paul Dirac in 1927. This quantum field theory could be 
used to model important processes such as the emission of a photon by an electron dropping into a quantum state of 
lower energy, a process in which the number of particles changes — one atom in the initial state becomes an atom 
plus a photon in the final state. It is now understood that the ability to describe such processes is one of the most 
important features of quantum field theory. 

It was evident from the beginning that a proper quantum treatment of the electromagnetic field had to somehow 
incorporate Einstein's relativity theory, which had after all grown out of the study of classical electromagnetism. This 
need to put together relativity and quantum mechanics was the second major motivation in the development of 
quantum field theory. Pascual Jordan and Wolfgang Pauli showed in 1928 that quantum fields could be made to 
behave in the way predicted by special relativity during coordinate transformations (specifically, they showed that 
the field commutators were Lorentz invariant). A further boost for quantum field theory came with the discovery of 
the Dirac equation, which was originally formulated and interpreted as a single-particle equation analogous to the 
Schrodinger equation, but unlike the Schrodinger equation, the Dirac equation satisfies both Lorentz invariance, that 
is, the requirements of special relativity, and the rules of quantum mechanics. The Dirac equation accommodated the 
spin- 1/2 value of the electron and accounted for its magnetic moment as well as giving accurate predictions for the 
spectra of hydrogen. The attempted interpretation of the Dirac equation as a single-particle equation could not be 
maintained long, however, and finally it was shown that several of its undesirable properties (such as 
negative-energy states) could be made sense of by reformulating and reinterpreting the Dirac equation as a true field 
equation, in this case for the quantized "Dirac field" or the "electron field", with the "negative-energy solutions" 
pointing to the existence of anti -particles. This work was performed first by Dirac himself with the invention of hole 
theory 1930 and also by Wendell Furry, Robert Oppenheimer, Vladimir Fock, and others. Schrodinger, during the 
same period that he discovered his famous equation in 1926, also independently found the relativistic generalization 
of it known as the Klein-Gordon equation but dismissed it since, without spin, it predicted impossible properties for 
the hydrogen spectrum. See Oskar Klein, Walter Gordon. All relativistic wave equations that describe spin-zero 
particles are said to be of the Klein-Gordon type. 

A subtle and careful analysis in 1933 and later in 1950 by Niels Bohr and Leon Rosenfeld showed that there is a 
fundamental limitation on the ability to simultaneously measure the electric and magnetic field strengths that enter 
into the description of charges in interaction with radiation, imposed by the uncertainty principle, which must apply 
to all canonically conjugate quantities. This limitation is crucial for the successful formulation and interpretation of a 
quantum field theory of photons and electrons(quantum electrodynamics), and indeed,any perturbative quantum field 
theory. The analysis of Bohr and Rosenfeld explains fluctuations in the values of the electromagnetic field that differ 
from the classically "allowed" values distant from the sources of the field. Their analysis was crucial to showing that 
the limitations and physical implications of the uncertainty principle apply to all dynamical systems, whether fields 
or material particles. Their analysis also convinced most people that any notion of returning to a fundamental 
description of nature based on classical field theory, such as what Einstein aimed at with his numerous and failed 
attempts at a classical unified field theory, was simply out of the question. 

The third thread in the development of quantum field theory was the need to handle the statistics of many-particle 
systems consistently and with ease. In 1927 Jordan tried to extend the canonical quantization of fields to the 
many-body wave functions of identical particles, a procedure that is sometimes called second quantization. In 1928, 
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Jordan and Eugene Wigner found that the quantum field describing electrons, or other fermions, had to be expanded 
using anti-commuting creation and annihilation operators due to the Pauli exclusion principle. This thread of 
development was incorporated into many-body theory and strongly influenced condensed matter physics and nuclear 
physics. 

Despite its early successes quantum field theory was plagued by several serious theoretical difficulties. Basic 
physical quantities, such as the self-energy of the electron, the energy shift of electron states due to the presence of 
the electromagnetic field, gave infinite, divergent contributions — a nonsensical result — when computed using the 
perturbative techniques available in the 1930s and most of the 1940s. The electron self-energy problem was already 
a serious issue in the classical electromagnetic field theory, where the attempt to attribute to the electron a finite size 
or extent (the classical electron-radius) led immediately to the question of what non-electromagnetic stresses would 
need to be invoked, which would presumably hold the electron together against the Coulomb repulsion of its 
finite-sized "parts". The situation was dire, and had certain features that reminded many of the "Ray leigh- Jeans 
difficulty". What made the situation in the 1940s so desperate and gloomy, however, was the fact that the correct 
ingredients (the second-quantized Maxwell-Dirac field equations) for the theoretical description of interacting 
photons and electrons were well in place, and no major conceptual change was needed analogous to that which was 
necessitated by a finite and physically sensible account of the radiative behavior of hot objects, as provided by the 
Planck radiation law. 

This "divergence problem" was solved in the case of quantum electrodynmaics during the late 1940s and early 1950s 
by Hans Bethe, Tomonaga, Schwinger, Feynman, and Dyson, through the procedure known as renormalization. 
Great progress was made after realizing that ALL infinities in quantum electrodynamics are related to two effects: 
the self-energy of the electron/positron and vacuum polarization. Renormalization concerns the business of paying 
very careful attention to just what is meant by, for example, the very concepts "charge" and "mass" as they occur in 
the pure, non-interacting field-equations. The "vacuum" is itself polarizable and, hence, populated by virtual particle 
(on shell and off shell) pairs, and, hence, is a seething and busy dynamical system in its own right. This was a critical 
step in identifying the source of "infinities" and "divergences". The "bare mass" and the "bare charge" of a particle, 
the values that appear in the free-field equations (non-interacting case), are abstractions that are simply not realized 
in experiment (in interaction). What we measure, and hence, what we must take account of with our equations, and 
what the solutions must account for, are the "renormalized mass" and the "renormalized charge" of a particle. That is 
to say, the "shifted" or "dressed" values these quantities must have when due care is taken to include all deviations 
from their "bare values" is dictated by the very nature of quantum fields themselves. 

The first approach that bore fruit is known as the "interaction representation," (see the article Interaction picture) a 
Lorentz covariant and gauge-invariant generalization of time-dependent perturbation theory used in ordinary 
quantum mechanics, and developed by Tomonaga and Schwinger, generalizing earlier efforts of Dirac, Fock and 
Podolsky. Tomonaga and Schwinger invented a relativistically covariant scheme for representing field commutators 
and field operators intermediate between the two main representations of a quantum system, the Schrodinger and the 
Heisenberg representations (see the article on quantum mechanics). Within this scheme, field commutators at 
separated points can be evaluated in terms of "bare" field creation and annihilation operators. This allows for keeping 
track of the time-evolution of both the "bare" and "renormalized", or perturbed, values of the Hamiltonian and 
expresses everything in terms of the coupled, gauge invariant "bare" field-equations. Schwinger gave the most 
elegant formulation of this approach. The next and most famous development is due to Feynman, who, with his 
brilliant rules for assigning a "graph"/" diagram" to the terms in the scattering matrix (See S-Matrix Feynman 
diagrams). These directly corresponded (through the Schwinger-Dyson equation) to the measurable physical 
processes (cross sections, probability amplitudes, decay widths and lifetimes of excited states) one needs to be able 
to calculate. This revolutionized how quantum field theory calculations are carried-out in practice. 

Two classic text-books from the 1960s, J.D. Bjorken and S.D. Drell, Relativistic Quantum Mechanics (1964) and J.J. 
Sakurai, Advanced Quantum Mechanics (1967), thoroughly developed the Feynman graph expansion techniques 
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using physically intuitive and practical methods following from the correspondence principle, without worrying 
about the technicalities involved in deriving the Feynman rules from the superstructure of quantum field theory 
itself. Although both Feynman' s heuristic and pictorial style of dealing with the infinities, as well as the formal 
methods of Tomonaga and Schwinger, worked extremely well, and gave spectacularly accurate answers, the true 
analytical nature of the question of "renormalizability", that is, whether ANY theory formulated as a "quantum field 
theory" would give finite answers, was not worked-out till much later, when the urgency of trying to formulate finite 
theories for the strong and electro-weak (and gravitational interactions) demanded its solution. 

Renormalization in the case of QED was largely fortuitous due to the smallness of the coupling constant, the fact that 
the coupling has no dimensions involving mass, the so-called fine structure constant, and also the zero-mass of the 
gauge boson involved, the photon, rendered the small-distance/high-energy behavior of QED manageable. Also, 
electromagnetic processess are very "clean" in the sense that they are not badly suppressed/damped and/or hidden by 
the other gauge interactions. By 1958 Sidney Drell observed: "Quantum electrodynamics (QED) has achieved a 
status of peaceful coexistence with its divergences...." 

The unification of the electromagnetic force with the weak force encountered initial difficulties due to the lack of 
accelerator energies high enough to reveal processes beyond the Fermi interaction range. Additionally, a satisfactory 
theoretical understanding of hadron substructure had to be developed, culminating in the quark model. 

In the case of the strong interactions, progress concerning their short-distance/high-energy behavior was much 
slower and more frustrating. For strong interactions with the electro- weak fields, there were difficult issues regarding 
the strength of coupling, the mass generation of the force carriers as well as their non-linear, self interactions. 
Although there has been theoretical progress toward a grand unified quantum field theory incorporating the 
electro-magnetic force, the weak force and the strong force, empirical verification is still pending. Superunification, 
incorporating the gravitational force, is still very speculative, and is under intensive investigation by many of the best 
minds in contemporary theoretical physics. Gravitation is a tensor field description of a spin-2 gauge-boson, the 
"graviton", and is further discussed in the articles on general relativity and quantum gravity. 

From the point of view of the techniques of (four-dimensional) quantum field theory, and as the numerous and heroic 
efforts to formulate a consistent quantum gravity theory by some very able minds attests, gravitational quantization 
was, and is still, the reigning champion for bad behavior. There are problems and frustrations stemming from the fact 
that the gravitational coupling constant has dimensions involving inverse powers of mass, and as a simple 
consequence, it is plagued by badly behaved (in the sense of perturbation theory) non-linear and violent 
self-interactions. Gravity, basically, gravitates, which in turn... gravitates... and so on, (i.e., gravity is itself a source of 
gravity,...,) thus creating a nightmare at all orders of perturbation theory. Also, gravity couples to all energy equally 
strongly, as per the equivalence principle, so this makes the notion of ever really "switching-off", "cutting-off" or 
separating, the gravitational interaction from other interactions ambiguous and impossible since, with gravitation, we 
are dealing with the very structure of space-time itself. (See general covariance and, for a modest, yet highly 
non-trivial and significant interplay between (QFT) and gravitation (spacetime), see the article Hawking radiation 
and references cited therein. Also quantum field theory in curved spacetime). 

Thanks to the somewhat brute-force, clanky and heuristic methods of Feynman, and the elegant and abstract methods 
of Tomonaga/Schwinger, from the period of early renormalization, we do have the modern theory of quantum 
electrodynamics (QED). It is still the most accurate physical theory known, the prototype of a successful quantum 
field theory. Beginning in the 1950s with the work of Yang and Mills, as well as Ryoyu Utiyama, following the 
previous lead of Weyl and Pauli, deep explorations illuminated the types of symmetries and in variances any field 
theory must satisfy. QED, and indeed, all field theories, were generalized to a class of quantum field theories known 
as gauge theories. Quantum electrodynamics is the most famous example of what is known as an Abelian gauge 
theory. It relies on the symmetry group U(l) and has one massless gauge field, the U(l) gauge symmetry, dictating 
the form of the interactions involving the electromagnetic field, with the photon being the gauge boson. That 
symmetries dictate, limit and necessitate the form of interaction between particles is the essence of the "gauge theory 
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revolution." Yang and Mills formulated the first explicit example of a non-Abelian gauge theory, Yang-Mills theory, 
with an attempted explanation of the strong interactions in mind. The strong interactions were then (incorrectly) 
understood in the mid-1950s, to be mediated by the pi-mesons, the particles predicted by Hideki Yukawa in 1935, 
based on his profound reflections concerning the reciprocal connection between the mass of any force-mediating 
particle and the range of the force it mediates. This was allowed by the uncertainty principle. The 1960s and 1970s 
saw the formulation of a gauge theory now known as the Standard Model of particle physics, which systematically 
describes the elementary particles and the interactions between them. 

The electroweak interaction part of the standard model was formulated by Sheldon Glashow in the years 1958-60 
with his discovery of the SU(2)xU(l) group structure of the theory. Steven Weinberg and Abdus Salam brilliantly 
invoked the Anderson-Higgs mechanism for the generation of the W's and Z masses (the intermediate vector 
boson(s) responsible for the weak interactions and neutral-currents) and keeping the mass of the photon zero. The 
Goldstone/Higgs idea for generating mass in gauge theories was sparked in the late 1950s and early 1960s when a 
number of theoreticians (including Yoichiro Nambu, Steven Weinberg, Jeffrey Goldstone, Frangois Englert, Robert 
Brout, G. S. Guralnik, C. R. Hagen, Tom Kibble and Philip Warren Anderson) noticed a possibly useful analogy to 
the (spontaneous) breaking of the U(l) symmetry of electromagnetism in the formation of the BCS ground-state of a 
superconductor. The gauge boson involved in this situation, the photon, behaves as though it has acquired a finite 
mass. There is a further possibility that the physical vacuum (ground- state) does not respect the symmetries implied 
by the "unbroken" electroweak Lagrangian (see the article Electroweak interaction for more details) from which one 
arrives at the field equations. The electroweak theory of Weinberg and Salam was shown to be renormalizable 
(finite) and hence consistent by Gerardus 't Hooft and Martinus Veltman. The Glashow- Weinberg-Salam theory 
(GWS-Theory) is a triumph and, in certain applications, gives an accuracy on a par with quantum electrodynamics. 

Also during the 1970s parallel developments in the study of phase transitions in condensed matter physics led Leo 
Kadanoff, Michael Fisher and Kenneth Wilson (extending work of Ernst Stueckelberg, Andre Peterman, Murray 
Gell-Mann, and Francis Low) to a set of ideas and methods known as the renormalization group. By providing a 
better physical understanding of the renormalization procedure invented in the 1940s, the renormalization group 
sparked what has been called the "grand synthesis" of theoretical physics, uniting the quantum field theoretical 
techniques used in particle physics and condensed matter physics into a single theoretical framework. 

The study of quantum field theory is alive and flourishing, as are applications of this method to many physical 
problems. It remains one of the most vital areas of theoretical physics today, providing a common language to many 
branches of physics. 

Principles of quantum field theory 
Classical fields and quantum fields 

Quantum mechanics, in its most general formulation, is a theory of abstract operators (observables) acting on an 
abstract state space (Hilbert space), where the observables represent physically observable quantities and the state 
space represents the possible states of the system under study. Furthermore, each observable corresponds, in a 
technical sense, to the classical idea of a degree of freedom. For instance, the fundamental observables associated 
with the motion of a single quantum mechanical particle are the position and momentum operators x an d p . 
Ordinary quantum mechanics deals with systems such as this, which possess a small set of degrees of freedom. 
(It is important to note, at this point, that this article does not use the word "particle" in the context of wave-particle 
duality. In quantum field theory, "particle" is a generic term for any discrete quantum mechanical entity, such as an 
electron or photon, which can behave like classical particles or classical waves under different experimental 
conditions.) 

A quantum field is a quantum mechanical system containing a large, and possibly infinite, number of degrees of 
freedom. A classical field contains a set of degrees of freedom at each point of space; for instance, the classical 
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electromagnetic field defines two vectors — the electric field and the magnetic field — that can in principle take on 
distinct values for each position r . When the field as a whole is considered as a quantum mechanical system, its 
observables form an infinite (in fact uncountable) set, because r is continuous. 

Furthermore, the degrees of freedom in a quantum field are arranged in "repeated" sets. For example, the degrees of 
freedom in an electromagnetic field can be grouped according to the position r , with exactly two vectors for each 
r . Note that r is an ordinary number that "indexes" the observables; it is not to be confused with the position 
operator x encountered in ordinary quantum mechanics, which is an observable. (Thus, ordinary quantum 
mechanics is sometimes referred to as "zero-dimensional quantum field theory", because it contains only a single set 
of observables.) 

It is also important to note that there is nothing special about r because, as it turns out, there is generally more than 
one way of indexing the degrees of freedom in the field. 

In the following sections, we will show how these ideas can be used to construct a quantum mechanical theory with 
the desired properties. We will begin by discussing single-particle quantum mechanics and the associated theory of 
many-particle quantum mechanics. Then, by finding a way to index the degrees of freedom in the many-particle 
problem, we will construct a quantum field and study its implications. 

Single-particle and many-particle quantum mechanics 

In quantum mechanics, the time-dependent Schrodinger equation for a single particle is 



We wish to consider how this problem generalizes to 7VP ar tid es - There are two motivations for studying the 

many-particle problem. The first is a straightforward need in condensed matter physics, where typically the number 

23 

of particles is on the order of Avogadro's number (6.0221415 x 10 ). The second motivation for the many-particle 
problem arises from particle physics and the desire to incorporate the effects of special relativity. If one attempts to 
include the relativistic rest energy into the above equation (in quantum mechanics where position is an observable), 
the result is either the Klein-Gordon equation or the Dirac equation. However, these equations have many 
unsatisfactory qualities; for instance, they possess energy eigenvalues that extend to — °o, so that there seems to be no 
easy definition of a ground state. It turns out that such inconsistencies arise from relativistic wavefunctions having a 
probabilistic interpretation in position space, as probability conservation is not a relativistically covariant concept. In 
quantum field theory, unlike in quantum mechanics, position is not an observable, and thus, one does not need the 
concept of a position-space probability density. For quantum fields whose interaction can be treated perturbatively, 
this is equivalent to neglecting the possibility of dynamically creating or destroying particles, which is a crucial 
aspect of relativistic quantum theory. Einstein's famous mass-energy relation allows for the possibility that 
sufficiently massive particles can decay into several lighter particles, and sufficiently energetic particles can combine 
to form massive particles. For example, an electron and a positron can annihilate each other to create photons. This 
suggests that a consistent relativistic quantum theory should be able to describe many -particle dynamics. 

Furthermore, we will assume that the TV particles are indistinguishable. As described in the article on identical 
particles, this implies that the state of the entire system must be either symmetric (bosons) or antisymmetric 
(fermions) when the coordinates of its constituent particles are exchanged. These multi-particle states are rather 
complicated to write. For example, the general quantum state of a system of TV bosons is written as 



over all possible permutations p acting on elements. In general, this is a sum of 7V!( TV factorial) distinct 
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where are the single-particle states, Njis the number of particles occupying state j , and the sum is taken 
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terms, which quickly becomes unmanageable as TV mcre ases. The way to simplify this problem is to turn it into a 
quantum field theory. 

Second quantization 

In this section, we will describe a method for constructing a quantum field theory called second quantization. This 

basically involves choosing a way to index the quantum mechanical degrees of freedom in the space of multiple 

identical-particle states. It is based on the Hamiltonian formulation of quantum mechanics; several other approaches 

Mi 

exist, such as the Feynman path integral, which uses a Lagrangian formulation. For an overview, see the article on 
quantization. 

Second quantization of bosons 

For simplicity, we will first discuss second quantization for bosons, which form perfectly symmetric quantum states. 
Let us denote the mutually orthogonal single-particle states by |02)j 1 03 ) , and so on. For example, the 

3-particle state with one particle in state and two in state |0 2 ) * s 

^ [\M<h)\<h) + |0 2 >|0i>|0 2 > + \<h)\<h)\<f>i)} ■ 

The first step in second quantization is to express such quantum states in terms of occupation numbers, by listing 
the number of particles occupying each of the single-particle states |02} 5 etc - This is simply another way of 

labelling the states. For instance, the above 3-particle state is denoted as 
|1,2,0,0,0,...). 

The next step is to expand the TV -particle state space to include the state spaces for all possible values of TV • This 
extended state space, known as a Fock space, is composed of the state space of a system with no particles (the 
so-called vacuum state), plus the state space of a 1 -particle system, plus the state space of a 2-particle system, and so 
forth. It is easy to see that there is a one-to-one correspondence between the occupation number representation and 
valid boson states in the Fock space. 

At this point, the quantum mechanical system has become a quantum field in the sense we described above. The 
field's elementary degrees of freedom are the occupation numbers, and each occupation number is indexed by a 
number j • • • , indicating which of the single-particle states |0i) , |02) j ■ ■ ■ " " " ^ refers to. 
The properties of this quantum field can be explored by defining creation and annihilation operators, which add and 
subtract particles. They are analogous to "ladder operators" in the quantum harmonic oscillator problem, which 
added and subtracted energy quanta. However, these operators literally create and annihilate particles of a given 
quantum state. The bosonic annihilation operator <22and creation operator a\ nave me following effects: 

oalJVi, N 2 , N 3 ,...) = Jn~ 2 \ N u (N 2 - 1), N 3 , . . .}, 
4\N U N 2 , N 3 , ...) = yfN 2 + l | N u (N 2 + 1), N 3 , . . .). 

It can be shown that these are operators in the usual quantum mechanical sense, i.e. linear operators acting on the 
Fock space. Furthermore, they are indeed Hermitian conjugates, which justifies the way we have written them. They 
can be shown to obey the commutation relation 

[a h a 5 ] = 0 , [aj, a]] = 0 , [a*, a]] = S ij: 

where S stands for the Kronecker delta. These are precisely the relations obeyed by the ladder operators for an 
infinite set of independent quantum harmonic oscillators, one for each single-particle state. Adding or removing 
bosons from each state is therefore analogous to exciting or de-exciting a quantum of energy in a harmonic 
oscillator. 

The Hamiltonian of the quantum field (which, through the Schrodinger equation, determines its dynamics) can be 
written in terms of creation and annihilation operators. For instance, the Hamiltonian of a field of free 
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(non-interacting) bosons is 
k 

where E k is the energy of the -th single-particle energy eigenstate. Note that 

ata k \...,N k) ...) = N k \...,N k ,...). 
Second quantization of fermions 

It turns out that a different definition of creation and annihilation must be used for describing fermions. According to 
the Pauli exclusion principle, fermions cannot share quantum states, so their occupation numbers N{ can only take 
on the value 0 or 1. The fermionic annihilation operators c and creation operators c t are defined by their actions on 
a Fock state thus 

c j \N 1 ,N 2 ,...,N j = 0,...) = 0 

Cj \N u N 2 , ...,Nj = l,...) = (-l)( JVl+ - +JV -^|JV 1) N 2 , ...,Nj = 0,.. .} 
c}\N u N 2 , . . . , Nj = 0, . . .) = (-1)^ + -+ N ^\N U N 2 , . . . , = 1, . . .) 
4\N u N 2 ,...,N j = l,...)=0. 

These obey an anticommutation relation: 

{c i ,c j } = {) , {cj, C t} = 0 , {c i ,ct}=^-. 

One may notice from this that applying a fermionic creation operator twice gives zero, so it is impossible for the 
particles to share single-particle states, in accordance with the exclusion principle. 

Field operators 

We have previously mentioned that there can be more than one way of indexing the degrees of freedom in a quantum 
field. Second quantization indexes the field by enumerating the single-particle quantum states. However, as we have 
discussed, it is more natural to think about a "field", such as the electromagnetic field, as a set of degrees of freedom 
indexed by position. 

To this end, we can define field operators that create or destroy a particle at a particular point in space. In particle 
physics, these operators turn out to be more convenient to work with, because they make it easier to formulate 
theories that satisfy the demands of relativity. 

Single-particle states are usually enumerated in terms of their momenta (as in the particle in a box problem.) We can 
construct field operators by applying the Fourier transform to the creation and annihilation operators for these states. 
For example, the bosonic field annihilation operator 0(r)is 

3 

The bosonic field operators obey the commutation relation 

[0(r),0(r')]=O , [4>Hr),^(r')]=0 , [0(r), 0+(r')] = S 3 (r - r') 

where 8{x) stands for the Dirac delta function. As before, the fermionic relations are the same, with the 
commutators replaced by anticommutators. 

It should be emphasized that the field operator is not the same thing as a single-particle wavefunction. The former is 
an operator acting on the Fock space, and the latter is a quantum-mechanical amplitude for finding a particle in some 
position. However, they are closely related, and are indeed commonly denoted with the same symbol. If we have a 
Hamiltonian with a space representation, say 
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where the indices % and j run over all particles, then the field theory Hamiltonian (in the non-relativistic limit and 
for negligible self-interactions) is 



This looks remarkably like an expression for the expectation value of the energy, with 0 playing the role of the 
wavefunction. This relationship between the field operators and wavefunctions makes it very easy to formulate field 
theories starting from space-projected Hamiltonians. 

Implications of quantum field theory 
Unification of fields and particles 

The "second quantization" procedure that we have outlined in the previous section takes a set of single-particle 
quantum states as a starting point. Sometimes, it is impossible to define such single-particle states, and one must 
proceed directly to quantum field theory. For example, a quantum theory of the electromagnetic field must be a 
quantum field theory, because it is impossible (for various reasons) to define a wavefunction for a single photon. In 
such situations, the quantum field theory can be constructed by examining the mechanical properties of the classical 
field and guessing the corresponding quantum theory. For free (non-interacting) quantum fields, the quantum field 
theories obtained in this way have the same properties as those obtained using second quantization, such as 
well-defined creation and annihilation operators obeying commutation or anticommutation relations. 

Quantum field theory thus provides a unified framework for describing "field-like" objects (such as the 
electromagnetic field, whose excitations are photons) and "particle-like" objects (such as electrons, which are treated 
as excitations of an underlying electron field), so long as one can treat interactions as "perturbations" of free fields. 
There are still unsolved problems relating to the more general case of interacting fields that may or may not be 
adequately described by perturbation theory. For more on this topic, see Haag's theorem. 

Physical meaning of particle indistinguishability 

The second quantization procedure relies crucially on the particles being identical. We would not have been able to 
construct a quantum field theory from a distinguishable many-particle system, because there would have been no 
way of separating and indexing the degrees of freedom. 

Many physicists prefer to take the converse interpretation, which is that quantum field theory explains what identical 
particles are. In ordinary quantum mechanics, there is not much theoretical motivation for using symmetric 
(bosonic) or antisymmetric (fermionic) states, and the need for such states is simply regarded as an empirical fact. 
From the point of view of quantum field theory, particles are identical if and only if they are excitations of the same 
underlying quantum field. Thus, the question "why are all electrons identical?" arises from mistakenly regarding 
individual electrons as fundamental objects, when in fact it is only the electron field that is fundamental. 

Particle conservation and non-conservation 

During second quantization, we started with a Hamiltonian and state space describing a fixed number of particles ( 
TVX an d ended with a Hamiltonian and state space for an arbitrary number of particles. Of course, in many common 
situations TV is an important and perfectly well-defined quantity, e.g. if we are describing a gas of atoms sealed in a 
box. From the point of view of quantum field theory, such situations are described by quantum states that are 
eigenstates of the number operator ]\f , which measures the total number of particles present. As with any quantum 
mechanical observable, yyis conserved if it commutes with the Hamiltonian. In that case, the quantum state is 
trapped in the TV -particle subspace of the total Fock space, and the situation could equally well be described by 
ordinary TV -particle quantum mechanics. (Strictly speaking, this is only true in the noninteracting case or in the low 
energy density limit of renormalized quantum field theories) 



H=-^- / d\ 0t(r)V 2 0(r) + d 3 r / dV ${t)<P{t?)U{\t - r'|)0(r')0(r). 
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For example, we can see that the free-boson Hamiltonian described above conserves particle number. Whenever the 
Hamiltonian operates on a state, each particle destroyed by an annihilation operator a>k is immediately put back by 
the creation operator a\. • 

On the other hand, it is possible, and indeed common, to encounter quantum states that are not eigenstates of ]\f , 
which do not have well-defined particle numbers. Such states are difficult or impossible to handle using ordinary 
quantum mechanics, but they can be easily described in quantum field theory as quantum superpositions of states 
having different values of TV . For example, suppose we have a bosonic field whose particles can be created or 
destroyed by interactions with a fermionic field. The Hamiltonian of the combined system would be given by the 
Hamiltonians of the free boson and free fermion fields, plus a "potential energy" term such as 

H i = £ V q (a q + al q )c\ +q c k , 

k,q 

where a\. an d a k denotes the bosonic creation and annihilation operators, c£ an d c k denotes the fermionic 
creation and annihilation operators, and V^is a parameter that describes the strength of the interaction. This 
"interaction term" describes processes in which a fermion in state fa either absorbs or emits a boson, thereby being 
kicked into a different eigenstate k + q . (In fact, this type of Hamiltonian is used to describe interaction between 
conduction electrons and phonons in metals. The interaction between electrons and photons is treated in a similar 
way, but is a little more complicated because the role of spin must be taken into account.) One thing to notice here is 
that even if we start out with a fixed number of bosons, we will typically end up with a superposition of states with 

different numbers of bosons at later times. The number of fermions, however, is conserved in this case. 

In condensed matter physics, states with ill-defined particle numbers are particularly important for describing the 

various superfluids. Many of the defining characteristics of a superfluid arise from the notion that its quantum state is 

a superposition of states with different particle numbers. In addition, the concept of a coherent state (used to model 

the laser and the BCS ground state) refers to a state with an ill-defined particle number but a well-defined phase. 

Axiomatic approaches 

The preceding description of quantum field theory follows the spirit in which most physicists approach the subject. 
However, it is not mathematically rigorous. Over the past several decades, there have been many attempts to put 
quantum field theory on a firm mathematical footing by formulating a set of axioms for it. These attempts fall into 
two broad classes. 

The first class of axioms, first proposed during the 1950s, include the Wightman, Osterwalder-Schrader, and 
Haag-Kastler systems. They attempted to formalize the physicists' notion of an "operator- valued field" within the 
context of functional analysis, and enjoyed limited success. It was possible to prove that any quantum field theory 
satisfying these axioms satisfied certain general theorems, such as the spin- statistics theorem and the CPT theorem. 
Unfortunately, it proved extraordinarily difficult to show that any realistic field theory, including the Standard 
Model, satisfied these axioms. Most of the theories that could be treated with these analytic axioms were physically 
trivial, being restricted to low-dimensions and lacking interesting dynamics. The construction of theories satisfying 
one of these sets of axioms falls in the field of constructive quantum field theory. Important work was done in this 
area in the 1970s by Segal, Glimm, Jaffe and others. 

During the 1980s, a second set of axioms based on geometric ideas was proposed. This line of investigation, which 
restricts its attention to a particular class of quantum field theories known as topological quantum field theories, is 
associated most closely with Michael Atiyah and Graeme Segal, and was notably expanded upon by Edward Witten, 
Richard Borcherds, and Maxim Kontsevich. However, most of the physically relevant quantum field theories, such 
as the Standard Model, are not topological quantum field theories; the quantum field theory of the fractional 
quantum Hall effect is a notable exception. The main impact of axiomatic topological quantum field theory has been 
on mathematics, with important applications in representation theory, algebraic topology, and differential geometry. 
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Finding the proper axioms for quantum field theory is still an open and difficult problem in mathematics. One of the 
Millennium Prize Problems — proving the existence of a mass gap in Yang-Mills theory — is linked to this issue. 

Phenomena associated with quantum field theory 

In the previous part of the article, we described the most general properties of quantum field theories. Some of the 
quantum field theories studied in various fields of theoretical physics possess additional special properties, such as 
renormalizability, gauge symmetry, and super symmetry. These are described in the following sections. 

Renormalization 

Early in the history of quantum field theory, it was found that many seemingly innocuous calculations, such as the 
perturbative shift in the energy of an electron due to the presence of the electromagnetic field, give infinite results. 
The reason is that the perturbation theory for the shift in an energy involves a sum over all other energy levels, and 
there are infinitely many levels at short distances that each give a finite contribution. 

Many of these problems are related to failures in classical electrodynamics that were identified but unsolved in the 
19th century, and they basically stem from the fact that many of the supposedly "intrinsic" properties of an electron 
are tied to the electromagnetic field that it carries around with it. The energy carried by a single electron — its self 
energy — is not simply the bare value, but also includes the energy contained in its electromagnetic field, its 
attendant cloud of photons. The energy in a field of a spherical source diverges in both classical and quantum 
mechanics, but as discovered by Weisskopf, in quantum mechanics the divergence is much milder, going only as the 
logarithm of the radius of the sphere. 

The solution to the problem, presciently suggested by Stueckelberg, independently by Bethe after the crucial 
experiment by Lamb, implemented at one loop by Schwinger, and systematically extended to all loops by Feynman 
and Dyson, with converging work by Tomonaga in isolated postwar Japan, is called renormalization. The technique 
of renormalization recognizes that the problem is essentially purely mathematical, that extremely short distances are 
at fault. In order to define a theory on a continuum, first place a cutoff on the fields, by postulating that quanta 
cannot have energies above some extremely high value. This has the effect of replacing continuous space by a 
structure where very short wavelengths do not exist, as on a lattice. Lattices break rotational symmetry, and one of 
the crucial contributions made by Feynman, Pauli and Villars, and modernized by 't Hooft and Veltman, is a 
symmetry preserving cutoff for perturbation theory. There is no known symmetrical cutoff outside of perturbation 
theory, so for rigorous or numerical work people often use an actual lattice. 

On a lattice, every quantity is finite but depends on the spacing. When taking the limit of zero spacing, we make sure 
that the physically observable quantities like the observed electron mass stay fixed, which means that the constants 
in the Lagrangian defining the theory depend on the spacing. Hopefully, by allowing the constants to vary with the 
lattice spacing, all the results at long distances become insensitive to the lattice, defining a continuum limit. 

The renormalization procedure only works for a certain class of quantum field theories, called renormalizable 
quantum field theories. A theory is perturbatively renormalizable when the constants in the Lagrangian only 
diverge at worst as logarithms of the lattice spacing for very short spacings. The continuum limit is then well defined 
in perturbation theory, and even if it is not fully well defined non-perturbatively, the problems only show up at 
distance scales that are exponentially small in the inverse coupling for weak couplings. The Standard Model of 
particle physics is perturbatively renormalizable, and so are its component theories (quantum 
electrodynamics/electro weak theory and quantum chromodynamics). Of the three components, quantum 
electrodynamics is believed to not have a continuum limit, while the asymptotically free SU(2) and SU(3) weak 
hypercharge and strong color interactions are nonperturbatively well defined. 

The renormalization group describes how renormalizable theories emerge as the long distance low-energy effective 
field theory for any given high-energy theory. Because of this, renormalizable theories are insensitive to the precise 
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nature of the underlying high-energy short-distance phenomena. This is a blessing because it allows physicists to 
formulate low energy theories without knowing the details of high energy phenomenon. It is also a curse, because 
once a renormalizable theory like the standard model is found to work, it gives very few clues to higher energy 
processes. The only way high energy processes can be seen in the standard model is when they allow otherwise 
forbidden events, or if they predict quantitative relations between the coupling constants. 

Gauge freedom 

A gauge theory is a theory that admits a symmetry with a local parameter. For example, in every quantum theory the 
global phase of the wave function is arbitrary and does not represent something physical. Consequently, the theory is 
invariant under a global change of phases (adding a constant to the phase of all wave functions, everywhere); this is a 
global symmetry. In quantum electrodynamics, the theory is also invariant under a local change of phase, that is - 
one may shift the phase of all wave functions so that the shift may be different at every point in space-time. This is a 
local symmetry. However, in order for a well-defined derivative operator to exist, one must introduce a new field, 
the gauge field, which also transforms in order for the local change of variables (the phase in our example) not to 
affect the derivative. In quantum electrodynamics this gauge field is the electromagnetic field. The change of local 
gauge of variables is termed gauge transformation. 

In quantum field theory the excitations of fields represent particles. The particle associated with excitations of the 
gauge field is the gauge boson, which is the photon in the case of quantum electrodynamics. 

The degrees of freedom in quantum field theory are local fluctuations of the fields. The existence of a gauge 
symmetry reduces the number of degrees of freedom, simply because some fluctuations of the fields can be 
transformed to zero by gauge transformations, so they are equivalent to having no fluctuations at all, and they 
therefore have no physical meaning. Such fluctuations are usually called "non-physical degrees of freedom" or gauge 
artifacts; usually some of them have a negative norm, making them inadequate for a consistent theory. Therefore, if 
a classical field theory has a gauge symmetry, then its quantized version (i.e. the corresponding quantum field 
theory) will have this symmetry as well. In other words, a gauge symmetry cannot have a quantum anomaly. If a 
gauge symmetry is anomalous (i.e. not kept in the quantum theory) then the theory is non-consistent: for example, in 
quantum electrodynamics, had there been a gauge anomaly, this would require the appearance of photons with 
longitudinal polarization and polarization in the time direction, the latter having a negative norm, rendering the 
theory inconsistent; another possibility would be for these photons to appear only in intermediate processes but not 
in the final products of any interaction, making the theory non unitary and again inconsistent (see optical theorem). 

In general, the gauge transformations of a theory consist of several different transformations, which may not be 
commutative. These transformations are together described by a mathematical object known as a gauge group. 
Infinitesimal gauge transformations are the gauge group generators. Therefore the number of gauge bosons is the 
group dimension (i.e. number of generators forming a basis). 

All the fundamental interactions in nature are described by gauge theories. These are: 

• Quantum electrodynamics, whose gauge transformation is a local change of phase, so that the gauge group is 
U(l). The gauge boson is the photon. 

• Quantum chromodynamics, whose gauge group is SU(3). The gauge bosons are eight gluons. 

• The electroweak Theory, whose gauge group is U(l) ® SU (2) (a direct product of U(l) and SU(2)). 

• Gravity, whose classical theory is general relativity, admits the equivalence principle, which is a form of gauge 
symmetry. However, it is explicitly non-renormalizable. 
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Multivalued Gauge Transformations 

The gauge transformations which leave the theory invariant involve by definition only single- valued gauge functions 



functions which violate the integrability criterion. These are capable of changing the physical field strengths and are 
therefore no proper symmetry transformations. Nevertheless, the transformed field equations describe correctly the 
physical laws in the presence of the newly generated field strengths. See the textbook by H. Kleinert cited below for 
the applications to phenomena in physics. 

Supersymmetry 

Supersymmetry assumes that every fundamental fermion has a superpartner that is a boson and vice versa. It was 
introduced in order to solve the so-called Hierarchy Problem, that is, to explain why particles not protected by any 
symmetry (like the Higgs boson) do not receive radiative corrections to its mass driving it to the larger scales (GUT, 
Planck...). It was soon realized that supersymmetry has other interesting properties: its gauged version is an 
extension of general relativity (Supergravity), and it is a key ingredient for the consistency of string theory. 

The way supersymmetry protects the hierarchies is the following: since for every particle there is a superpartner with 
the same mass, any loop in a radiative correction is cancelled by the loop corresponding to its superpartner, 
rendering the theory UV finite. 

Since no superpartners have yet been observed, if supersymmetry exists it must be broken (through a so-called soft 
term, which breaks supersymmetry without ruining its helpful features). The simplest models of this breaking require 
that the energy of the superpartners not be too high; in these cases, supersymmetry is expected to be observed by 
experiments at the Large Hadron Collider. 




See also 



• List of quantum field theories 

• Constructive quantum field theory 

• Feynman path integral 

• Quantum chromodynamics 

• Quantum electrodynamics 

• Quantum flavordynamics 

• Quantum geometrodynamics 

• Quantum hydrodynamics 

• Quantum magnetodynamics 

• Quantum triviality 

• Schwinger-Dyson equation 

• Einstein-Maxwell-Dirac equations 

• Relation between Schrodinger's equation and the path integral formulation of 



Abraham-Lorentz force 



Green's function (many-body theory) 

Common integrals in quantum field theory 

Wheeler-Feynman absorber theory 

Wigner's theorem 

Wigner's classification 

Static forces and virtual-particle exchange 



Photon polarization 

Theoretical and experimental justification for the 



Schrodinger equation 
Invariance mechanics 



Form factor 



Green-Kubo relations 



quantum mechanics 

• Basic concepts of quantum mechanics 

• Relationship between string theory and quantum field theory 



Quantum field theory 



210 



Notes 

[1] Weinberg, S. Quantum Field Theory, Vols. I to III, 2000, Cambridge University Press: Cambridge, UK. 
[2] (http :// physics . princeton.edu/ -mcdonald/ examples/ QM/ thorn_aj p_72_ 121 0_04 . pdf) 
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Scalar field theory 

In theoretical physics, scalar field theory can refer to a classical or quantum theory of scalar fields. A field which is 
invariant under any Lorentz transformation is called a "scalar", in contrast to a vector or tensor field. The quanta of 
the quantized scalar field are spin-zero particles, and as such are bosons. 

No fundamental scalar fields have been observed in nature, though the Higgs boson may yet prove the first example. 
However, scalar fields appear in the effective field theory descriptions of many physical phenomena. An example is 
the pion, which is actually a "pseudoscalar", which means it is not invariant under parity transformations which 
invert the spatial directions, distinguishing it from a true scalar, which is parity-invariant. Because of the relative 
simplicity of the mathematics involved, scalar fields are often the first field introduced to a student of classical or 
quantum field theory. 

In this article, the repeated index notation indicates the Einstein summation convention for summation over repeated 
indices. The theories described are defined in flat, D-dimensional Minkowski space, with (D-l) spatial dimension 
and one time dimension and are, by construction, relativistically covariant. The Minkowski space metric, , has a 
particularly simple form: it is diagonal, and here we use the + sign convention. 
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Classical scalar field theory 
Linear (free) theory 

The most basic scalar field theory is the linear theory. The action for the free relativistic scalar field theory is 

1 _ . 1 



S = J d^xdtC = J d D ~ l xdt 



/ 



= / d^xdf 



where £ is known as a Lagrangian density. This is an example of a quadratic action, since each of the terms is 
quadratic in the field, (j) . The term proportional to m 2 is sometimes known as a mass term, due to its interpretation 
in the quantized version of this theory in terms of particle mass. 

The equation of motion for this theory is obtained by extremizing the action above. It takes the following form, 
linear in 0 : 

rTd^ + ™*0 = dt<t> ~ V 2 0 + m 2 4> = 0 
Note that this is the same as the Klein-Gordon equation, but that here the interpretation is as a classical field 
equation, rather than as a quantum mechanical wave equation. 



Nonlinear (interacting) theory 

The most common generalization of the linear theory above is to add a scalar potential V^(0)to the equations of 
motion, where typically, V is a polynomial in cp of order 3 or more (often a monomial). Such a theory is sometimes 
said to be interacting, because the Euler-Lagrange equation is now is nonlinear, implying a self-interaction. The 
action for the most general such theory is 



S = J d D ~ 1 xdt£ = J d D ~ l xdt 



/I 1 1 * 1 

d^sdf -(drf) 2 - - -m 2 cf> 2 - J2 ~ y 9^ 

71=3 



The n\ factors in the expansion are introduced because they are useful in the Feynman diagram expansion of the 
quantum theory, as described below. The corresponding Euler-Lagrange equation of motion is 

rTd^ + V\<t)) = d\<$> - V 2 0 + V'{tf>) = 0. 
Dimensional analysis and scaling 

Physical quantities in these scalar field theories may have dimensions of length, time or mass, or some combination 
of the three. However, in a relativistic theory, any quantity t, with dimensions of time, can be 'converted' into a 
length, I = ct i by using the velocity of light, c. 

h 

Similarly, any length / is equivalent to an inverse mass, [ = , using Planck's constant, ft . Heuristically, one 

mc 

can think of a time as a length, or either time or length as an inverse mass. In short, one can think of the dimensions 
of any physical quantity as defined in terms of just one independent dimension, rather than in terms of all three. This 
is most often termed the mass dimension of the quantity. 

One objection is that this theory is classical, and therefore it is not obvious that Planck's constant should be a part of 
the theory at all. In a sense this is a valid objection, and if desired one can indeed recast the theory without mass 
dimensions at all. However, this would be at the expense of making the connection with the quantum scalar field 
slightly more obscure. Given that one has dimensions of mass, Planck's constant is thought of here as an essentially 
arbitrary fixed quantity with dimensions appropriate to convert between mass and inverse length. This is consistent 
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with the Feynman path integral approach to quantization, where the only reason for Planck's constant to appear stems 
from the same type of dimensional argument, since the action must be divided by some parameter with these 
dimensions to render the phase dimensionless. 

Scaling Dimension 

The classical scaling dimension, or mass dimension, A , of 0 describes the transformation of the field under a 
rescaling of coordinates: 

x — > Xx 

<j> \~ A <p 

The units of action are the same as the units of fa , and so the action itself has zero mass dimension. This fixes the 
scaling dimension of 0 to be 

Scale invariance 

There is a specific sense in which some scalar field theories are scale-invariant. While the actions above are all 
constructed to have zero mass dimension, not all actions are invariant under the scaling transformation 

x — > Xx 

The reason that not all actions are invariant is that one usually thinks of the parameters m and g n as fixed quantities, 
which are not rescaled under the transformation above. The condition for a scalar field theory to be scale invariant is 
then quite obvious: all of the parameters appearing in the action should be dimensionless quantities. In other words, a 
scale invariant theory is one without any fixed length scale (or equivalently, mass scale) in the theory. 

2D 



For a scalar field theory with D spacetime dimensions, the only dimensionless parameter g n satisfies n = ^_ ^ 

. For example, in D=4 only 54 is classically dimensionless, and so the only classically scale-invariant scalar field 
theory in £) = 4is the massless 0 4 theory. Classical scale invariance normally does not imply quantum scale 

invariance. See the discussion of the beta function below. 

Conformal invariance 

A transformation 

x —> x{x) 

is said to be conformal if the transformation satisfies 

for some function A 2 [x) • The conformal group contains as subgroups the isometries of the metric (the 

Poincare group) and also the scaling transformations (or dilatations) considered above. In fact, the scale-invariant 
theories in the previous section are also conformally-invariant. 
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<p 4 theory 

Massive 0 4 theory illustrates a number of interesting phenomena in scalar field theory. 
The Lagrangian density is 

Spontaneous symmetry breaking 

This Lagrangian has a Z2 symmetry under the transformation (j) — > —(f) 

This is an example of an internal symmetry, in contrast to a space-time symmetry. 

If TT^is positive, the potential V((f)) = — m 2 0 2 + — 0 4 has a single minimum, at the origin. The solution 

2 4! 

(j) = Ois clearly invariant under the Z2 symmetry. Conversely, if m 2 is negative, then one can readily see that the 
potential V((b) = —m 2 (t> 2 + — (£ 4 has two minima. This is known as a double well potential, and the lowest 

2 4! 

energy states (known as the vacua, in quantum field theoretical language) in such a theory are not invariant under the 
Z2 symmetry of the action (in fact it maps each of the two vacua into the other). In this case, the Z2 symmetry is 
said to be spontaneously broken. 

Kink solutions 

The 0 4 theory with a negative m 2 also has a kink solution, which is a canonical example of a soliton. Such a 
solution is of the form 

f / . x m , (mix — Xq)\ 

where x is one of the spatial variables ( 0 is taken to be independent of t, and the remaining spatial variables). The 
solution interpolates between the two different vacua of the double well potential. It is not possible to deform the 
kink into a constant solution without passing through a solution of infinite energy, and for this reason the kink is said 
to be stable. For ]J > 2 , i.e. theories with more than one spatial dimension, this solution is called a domain wall. 
Another well-known example of a scalar field theory with kink solutions is the sine-Gordon theory. 

Complex scalar field theory 

In a complex scalar field theory, the scalar field takes values in the complex numbers, rather than the real numbers. 
The action considered normally takes the form 



S = J dP^xdtC = J d^xdt [rrWdvt ~ ^(I0| 2 )] 



This has a U(l) symmetry, whose action on the space of fields rotates 0 — > e %OL (j) , for some real phase angle ot . 

2 

As for the real scalar field, spontaneous symmetry breaking is found if m is negative. This gives rise to a Mexican 
hat potential which is analogous to the double-well potential in real scalar field theory, but now the choice of 
vacuum breaks a continuous U(l) symmetry instead of a discrete one. This leads to a Goldstone boson. 

0(N) theory 

One can express the complex scalar field theory in terms of two real fields, (j) 1 = Re(j) and (j) 2 = Im(j) which 
transform in the vector representation of the U(l) = O (2) internal symmetry. Although such fields transform as a 
vector under the internal symmetry, they are still Lorentz scalars. This can be generalised to a theory of N scalar 
fields transforming in the vector representation of the 0(N) symmetry. The Lagrangian for an O (N) -invariant scalar 
field theory is typically of the form 
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£ = \rTW ' 14 ~ H<t> ' <l>) 
using an appropriate O(N) -invariant inner product. 

Quantum scalar field theory 

In quantum field theory, the fields, and all observables constructed from them, are replaced by quantum operators on 
a Hilbert space. This Hilbert space is built on a vacuum state, and dynamics are governed by a Hamiltonian, a 
positive operator which annihilates the vacuum. A construction of a quantum scalar field theory may be found in the 
canonical quantization article, which uses canonical commutation relations among the fields as a basis for the 
construction. In brief, the basic variables are the field cp and its canonical momentum jt. Both fields are Hermitian . 
At spatial points x, y at equal times, the canonical commutation relations are given by 

[(j>{x),(f>{y)] = [7r(x),7r(y)] = 0, [0(f), 7r(y)] = i6(x-y), 
and the free Hamiltonian is 



H = Jd 3 x 

A spatial Fourier transform leads to a momentum space fields 

0(jfc) = J d 3 xe~ its (l)(x), 7r(fc) = J tPie^' 3 ^ 
which are used to define annihilation and creation operators 

o(jfe) = (E<j>(k) + iir(jk)) , a + (jfc) = (E$(k) - m{k)) , 
where — \/jt 2 -|- m? • These operators satisfy the commutation relations 

[a(fci), a(k 2 )] = [a + (fci), a\k 2 )] = 0, [aft), <J(k 2 )} = (2 7 r) 3 2^(fc 1 - fc 2 ). 

The state I0> annihilated by all of the operators a is identified as the bare vacuum, and a particle with momentum 
is created by applying a^(k)^° me vacuum. Applying all possible combinations of creation operators to the vacuum 
constructs the Hilbert space. This construction is called Fock space. The vacuum is annihilated by the Hamiltonian 

where the zero-point energy has been removed by Wick-ordering. (See canonical quantization.) 

Interactions can be included by adding an interaction Hamiltonian. For a cp 4 theory, this corresponds to adding a 
Wick-ordered term g\cp A \IA\ to the Hamiltonian, and integrating over x. Scattering amplitudes may be calculated from 
this Hamiltonian in the interaction picture. These are constructed in perturbation theory by means of the Dyson 
series, which gives the time-ordered products, or ^-particle Green's functions (0|T \(j){xi) • • • 0(x n )}|O) as 
described in the Dyson series article. The Green's functions may also be obtained from a generating function that is 
constructed as a solution to the Schwinger-Dyson equation. 
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Feynman Path Integral 

The Feynman diagram expansion may be obtained also from the Feynman path integral formulation.^ The time 
ordered vacuum expectation values of polynomials in cp, known as the ^-particle Green's functions, are constructed 
by integrating over all possible fields, normalized by the vacuum expectation value with no external fields, 

(O|T{0(x a ) • • • <f>{x n )}\0) = J V<Wi) W*nJe 

All of these Green's functions may be obtained by expanding the exponential in J(x)cp(x) in the generating function 

Z\J\ = [ v ^^*^^-^-l^+ J *) = z r 0 i y ho\T{<f>( Xl ) • ■ -0(x„)}|O). 

A Wick rotation may be applied to make time imaginary. Changing the signature to (++++) then turns the Feynman 
integral into a statistical mechanics partition function in Euclidean space, 



Normally, this is applied to the scattering of particles with fixed momenta, in which case, a Fourier transform is 
useful, giving instead 



z[j] = J v^-sM^+^+t^-^). 



The standard trick to evaluate this functional integral is to write it as a product of exponential factors, schematically, 

-(^+mV/2 e -^ 4 /4! e -^' 



Z[J\~ J ' V4>H[e-b 2+ 



P 

The second two exponential factors can be expanded as power series, and the combinatorics of this expansion can be 
represented graphically. The integral with X = 0 can be treated as a product of infinitely many elementary Gaussian 
integrals, and the result may be expressed as a sum of Feynman diagrams, calculated using the following Feynman 
rules: 

• Each field in the n-point Euclidean Green's function is represented by an external line (half-edge) in the 

graph, and associated with momentum p. 

• Each vertex is represented by a factor -g. 

k 

• At a given order g , all diagrams with n external lines and k vertices are constructed such that the momenta 

2 2 

flowing into each vertex is zero. Each internal line is represented by a propagator l/(q + m ), where q is the 
momentum flowing through that line. 

• Any unconstrained momenta are integrated over all values. 

• The result is divided by a symmetry factor, which is the number of ways the lines and vertices of the graph can be 
rearranged without changing its connectivity. 

• Do not include graphs containing "vacuum bubbles", connected subgraphs with no external lines. 

The last rule takes into account the effect of dividing by Z[0] • The Minkowski-space Feynman rules are similar, 

2 2 

except that each vertex is represented by -ig, while each internal line is represented by a propagator i/(q -m + ie), 
where the 'e term represents the small Wick rotation needed to make the Minkowski-space Gaussian integral 
converge. 
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Renormalization 

The integrals over unconstrained momenta, called "loop integrals", in the Feynman graphs typically diverge. This is 

normally handled by renormalization, which is a procedure of adding divergent counter-terms to the Lagrangian in 

T21 

such a way that the diagrams constructed from the original Lagrangian and counter-terms is finite. A 
renormalization scale must be introduced in the process, and the coupling constant and mass become dependent upon 
it. 

The dependence of a coupling constant g on the scale X is encoded by a beta function, (3(g), defined by the relation 

This dependence on the energy scale is known as the running of the coupling parameter, and theory of this kind of 
scale-dependence in quantum field theory is described by the renormalization group. 

Beta-functions are usually computed in an approximation scheme, most commonly perturbation theory, where one 
assumes that the coupling constant is small. One can then make an expansion in powers of the coupling parameters 
and truncate the higher-order terms (also known as higher loop contributions, due to the number of loops in the 
corresponding Feynman graphs). 

The beta- function at one loop (the first perturbative contribution) for the 0 4 theory is 

The fact that the sign in front of the lowest-order term is positive suggests that the coupling constant increases with 
energy. If this behavior persists at large couplings, this would indicate the presence of a Landau pole at finite energy, 
or quantum triviality. The question can only be answered non-perturbatively, since it involves strong coupling. 

A quantum field theory is trivial when the running coupling, computed through its beta function, goes to zero when 
the cutoff is removed. Consequently, the propagator becomes that of a free particle and the field is no longer 
interacting. Alternatively, the field theory may be interpreted as an effective theory, in which the cutoff is not 
removed, giving finite interactions but leading to a Landau pole at some energy scale. For a cp 4 interaction, Michael 
Aizenman proved that the theory is indeed trivial for space-time dimension D > 5.^ For f) = 4 the triviality 
has yet to be proven rigorously, but lattice computations have confirmed this. (See Landau pole for details and 
references.) This fact is relevant as the Higgs field, for which triviality bounds are used to set limits on the Higgs 
mass, based on the new physics must enter at a higher scale (perhaps the Planck scale) to prevent the Landau pole 
from being reached. 
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Further reading 

• Peskin, M and Schroeder, D. \An Introduction to Quantum Field Theory, Westview Press (1995) 

• Weinberg, Steven ; The Quantum Theory of Fields, (3 volumes) Cambridge University Press (1995) 

• Srednicki, Mark; Quantum Field Theory, Cambridge University Press (2007) 

• Zinn- Justin, Jean ; Quantum Field Theory and Critical Phenomena, Oxford University Press (2002) 

External links 

• Pedagogic Aides to Quantum Field Theory (http://www.quantumfieldtheory.info) Click on the link for Chap. 3 
to find an extensive, simplified introduction to scalars in relativistic quantum mechanics and quantum field 
theory. 

• 't Hooft, G., "The Conceptual Basis of Quantum Field Theory" ( online version (http://www.phys.uu.nl/ 
-thooft/lectures/basisqft.pdf)). 

Yang-Mills theory 

Yang-Mills theory is a gauge theory based on the SU(N) group. Wolfgang Pauli formulated in 1953 the first 
consistent generalization of the five-dimensional theory of Kaluza, Klein, Fock and others to a higher dimensional 
internal space. ^ Because Pauli saw no way to give masses to the gauge bosons, he refrained from publishing his 
results formally J 1 ^ 

Although Pauli did not publish this theory, he gave talks widely attended by physicists of the time. In early 1954, 
Yang and Mills developed a modern formulation in an effort to extend the original concept of gauge theory for 
abelian groups, e.g. quantum electrodynamics, to nonabelian groups to provide an explanation for strong 
interactions. This initial idea was not a success, since the quanta of the Yang-Mills field must be massless in order to 
maintain gauge invariance. The massless particles should have long range effects, but these effects are not seen in 
experiments. The idea was set aside until 1960, when the concept of particles acquiring mass through symmetry 
breaking in massless theories was put forward, initially by Jeffrey Goldstone, Yoichiro Nambu, and Giovanni 
Jona-Lasinio. 

This prompted a significant restart of Yang-Mills theory studies that proved successful in the formulation of both 
electro weak unification and quantum chromodynamics (QCD). The electro weak interaction is described by 
SU(2)xU(l) group while QCD is an SU(3) gauge theory. The electro weak theory is obtained by combining SU(2) 
with U(l), where quantum electrodynamics (QED) is described by a U(l) group, and is replaced in the unified 
electro weak theory by a U(l) group representing a weak hypercharge rather than electric charge. The massless 
bosons from the SU(2)xU(l) theory mix after spontaneous symmetry breaking to produce the 3 massive weak 
bosons, and the photon field. The Standard Model combines the strong interaction, with the unified electroweak 
interaction (unifying the weak and electromagnetic interaction) through the symmetry group SU(2)xU(l)xSU(3). In 
the current epoch the strong interaction is not unified with the electroweak interaction, but from the observed 
running of the coupling constants it is believed they all converge to a single value at very high energies. 

Phenomenology at lower energies in quantum chromodynamics is not completely understood due to the difficulties 
of managing such a theory with a strong coupling. This is the reason confinement has not been theoretically proven, 
though it is a consistent experimental observation. Proof that QCD confines at low energy is a mathematical problem 
of great relevance, and an award has been proposed by the Clay Mathematics Institute for whoever is able to show 
that the Yang-Mills theory has a mass gap. 
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Mathematical overview 

Yang-Mills theories are a special example of gauge theory with symmetry non-abelian group given by the 
Lagrangian 

£, g , = -h T r(F 2 ) = -^ a F^ 

with the generators of the Lie algebra corresponding to the F-quantities (the curvature or field- strength form) 
satisfying 

[T a ,T b ] =if abc T c 

and the covariant derivative defined as 

Dp = Idp - i 9 T a Al 

where J is the identity for the group generators, A a ^ is the vector potential, and Q is the coupling constant. In four 

dimensions, the coupling constant Q is a pure number and for a SU(N) group one has a, 6, c = 1 . . . TV 2 — 1. 
The relation 

F; v = dpA* v - d u A; + gf^A^Al 

can be derived by the commutator 

The field has the property of being self-interacting and equations of motion that one obtains are said to be semilinear, 
as nonlinearities are both with and without derivatives. This means that one can manage this theory only by 
perturbation theory, with small nonlinearities. 

Note that the transition between "upper" ("contravariant") and "lower" ("covariant") vector or tensor components is 
trivial for a indices (e.g. f abc = f abc ), whereas for \i and v it is nontrivial, corresponding e.g. to the usual Lorentz 

signature, rj^ = diag (H ) . 

From the given Lagrangian one can derive the equations of motion given by 

d^F^ + gf abc A" b F^ = ^ 

Putting Ffu, = T a F^^ these can be rewritten as 

{D^F^f = 0. 

A Bianchi identity holds 

{D^Y + {DnF^Y + (D v F^) a = 0. 

A source enters into the equations of motion as 

d»Fz v + g r bc A» b F; v = -r v . 

Note that the currents must properly change under gauge group transformations. 



Quantization of Yang-Mills theory 

The most appropriate method to quantize the Yang-Mills theory is by functional methods, i.e. path integrals. One 
introduces a generating functional for n-point functions as 

Z ^ = J |^4je _ 4 / ^ 4a; Tr( J F ,//I/ i^ Miy )-|-i / d 4 x j^(x)A afJ '(x) 

but this integral has no meaning as is because the potential vector can be arbitrarily chosen due to the gauge freedom. 
This problem was already known for quantum electrodynamics but here becomes more severe due to non-abelian 
properties of the gauge group. A way out has been given by Ludvig Faddeev and Victor Popov with the introduction 
of a ghost field (see Faddeev-Popov ghost) that has the property of being unphysical since, although it agrees with 
Fermi-Dirac statistics, it is a complex scalar field, violating in this way the spin-statistics theorem. So, we can write 
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the generating functional as 



ZfcM] = j [dA][dc][dc]eH 

e i f d 4 xj£ { X )A<W (x)+i f d*x[c» (x)s a (x)c a {x)] 

that is the expression commonly used to derive Feynman's rules (see Feynman diagram). Here we have c a f° r me 
ghost field while a fixes the gauge's choice for the quantization. Feynman's rules obtained from this functional are 
the following 



, P 

01/ 'WAAAA/ a l J 

gluon propagator: D?*(p) = 



-iS ab 
p 2 + i0 




(q - r)tf*x + (r - p)*ifcx] 



4-gluon vertex: l^L = H^/*/*0wifc* -ifcrffc a) 
ghost propagator: C° (p) - 



p 2 + i0 



c .-^ a 



CCS - vertex: F^p) = gf abc Pfl 
These rules for Feynman diagrams are easily obtained when we realize that the generating functional given above 
can be rewritten as 

-ig fd 4 x * , f abc du— r— r#r -*9 f d 4 xf abc duT^-^— r~ rln 
Z[7, e, e] = e i*e°(aO-' ^Ju*) 5e ( E ) e J '■WW *3 CV W x 

2 

^ t 4 j dTxf f sjb ^ x) Sj c (x) Sj r^ x) Sjav{x) _^ ^ 



e 

being 

Z 0 \j,E,e] = E~ J ^^y^{x)C-\x-y)e\y) e \ j ^xd 4 yj-(x)D^(x- y )jUy) 

the generating functional of the free theory. Expanding in 9 and computing the functional derivatives, we are able to 
obtain all the n-point functions with perturbation theory. Using LSZ reduction formula we get from the n-point 
functions the corresponding amplitudes for the given processes and cross sections and decay rates are promptly 
obtained. The theory is renormalizable and corrections are finite at any order of perturbation theory. 

For quantum electrodynamics, being in this case abelian the gauge group (see abelian group), the ghost field 
decouples. This can be easily realized when we look at the coupling between the gauge field and the ghost field that 
is c a / a6c fl l pA bM C c . For the Abelian case all the structure constants f abc are zero and so there is no coupling. In 

the non- Abelian case, the ghost field appears as a useful way to rewrite the quantum field theory without physical 
consequences on the observables of the theory as cross sections or decay rates. 
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One of the most important results obtained for Yang-Mills theory is asymptotic freedom. This result can be obtained 
assuming the coupling constant 9 as small (so small nonlinearities), as indeed happens to high energies, and 
applying perturbation theory. The relevance of this result is due to the fact that a Yang-Mills theory describes strong 
interactions and asymptotic freedom permits to treat properly experimental results coming from deep inelastic 
scattering. 

In order to obtain the behavior at high energies of the Yang-Mills theory, and so to prove asymptotic freedom, one 
does perturbation theory assuming a small coupling. This is verified a posteriori in the ultraviolet limit. In the 
opposite limit, infrared limit, the situation is quite the opposite being the coupling too large for perturbation theory to 
be reliable. Indeed, most of the difficulties that current research meets is just managing the theory at low energies 
that is the interesting one being inherent to the description of hadronic matter and, more generally, to all the observed 
bound states of gluons and quarks and their confinement (see hadrons). Then, the most used method to study the 
theory in this limit is to try to solve it on computers (see lattice gauge theory). In this case, large computational 
resources are needed to be sure the right limit of infinite volume (smaller lattice spacing) is hit. This is the limit the 
results have to be compared with. Smaller spacing and larger coupling are not independent each others and to 
accomplish both larger computational resources are demanded. As for today, the situation appears somewhat 
satisfactory for the hadronic spectrum and the computation of the gluon and ghost propagators but the glueball and 
hybrids spectra are yet a questioned matter also in view of the experimental observation of such exotic states. Indeed, 
the a resonance^ ^ is not seen in any of such lattice computations and contrasting interpretations have been put 
forward. This is currently a hotly debated issue. 



Beta function 

One of the key properties of a quantum field theory is the behavior at all the energy range of the running coupling. 
Such a behavior can be obtained from a theory once its beta function is known. Our ability of extracting results from 
a quantum field theory relies on perturbation theory. Once the beta function is known, the behavior at all energy 
scales of the running coupling is obtained through the equation 

being a s = <? 2 /47r . Yang-Mills theory has the property of being asymptotically free in the large energy limit 

(ultraviolet limit). This means that, in this limit, beta function has a minus sign driving the behavior of the running 
coupling toward even smaller values as the energy increases. Perturbation theory permits to evaluate beta function in 
this limit producing the following result for SU(N) 

nl \ HJV 2 UN 2 3 , 4 , 

In the opposite limit of low energies (infrared limit), beta function is not known. It is note the exact one for a 
supersymmetric Yang— Mills theory. This has been obtained by Novikov, Shifman, Vainshtein and Zakharov^ and 
can be written as 

PM = - 



With this starting point, Thomas Ryttov and Francesco Sannino were able to postulate a non-supersymmetric version 
UN 1 



i"7] 

of it writing down 



127T 1 ±L£Li 

As can be seen from the beta function of the supersymmetric theory, the limit of a large coupling (infrared limit) 
implies 
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and so the running coupling in the deep infrared limit goes to zero making this theory trivial. This implies that the 
coupling reaches a maximum at some value of the energy turning again to zero as the energy is lowered. Then, if 
Ryttov and Sannino hypothesis is correct, the same should be true for ordinary Yang-Mills theory. This would be in 
agreement with recent lattice computations . 

Open problems 

The Yang-Mills theories were generally acknowledged in the physics community after Gerard 't Hooft, in 1972, 
could prove their renormalizability. This applies even if the gauge bosons described by this theory are massive, as in 
the electroweak theory. However, the mass is only an "acquired" one, namely, as suggested, by the famous Higgs 
mechanism. 

Concerning the mathematics, it should be noted that presently, i.e. in 2009, the Yang-Mills theory is a very active 
field of research, yielding e.g. a classification of differentiable structures of four-dimensional manifolds by Simon 
Donaldson. Furthermore, the field of Yang-Mills theories was included in the Clay Mathematics Institute's list of 
"Millennium Prize Problems". Here the prize-problem consists, especially, in a proof of the conjecture that the 
lowest excitations of a pure Yang-Mills theory (i.e. without matter fields) have a finite mass-gap with regard to the 
vacuum state. Another open problem, connected with this conjecture, is a proof of the confinement property in the 
presence of additional Fermion particles. 

In physics the survey of Yang-Mills theories does not usually start from perturbation analysis or analytical methods, 
but more recently from systematic application of numerical methods to lattice gauge theories. 

References 

[1] Straumann, N: "On Pauli's invention of non-abelian Kaluza-Klein Theory in 1953" eprint arXiv.gr=qc/00 12054 

[2] See Abraham Pais' account of this period as well as L. Susskind's "Superstrings, Physics World on the first non-abelian gauge theory" where 

Susskind wrote that Yang-Mills was "rediscovered" only because Pauli had chosen not to publish 
[3] Yang, C. N.; Mills, R. (1954), "Conservation of Isotopic Spin and Isotopic Gauge Invariance", Physical Review 96 (1): 191-195, 

doi: 1 0. 1 1 03/Phy sRev.96. 1 9 1 

[4] Caprini, L; Colangelo, G.; Leutwyler, H. (2006), "Mass and width of the lowest resonance in QCD", Physical Review Letters (13): 132001, 
doi: 10. 1 1 03/PhysRevLett.96. 1 32001 

[5] Yndurain, F. J.; Garcia-Martin, R.; Pelaez, J. R. (2007), "Experimental status of the 7T7T isoscalar S wave at low energy: Jo (600) pole 

and scattering length", Physical Review D 76 (7): 074034, doi:10.1103/PhysRevD.76.074034 
[6] Novikov, V. A.; Shifman, M. A.; A. I. Vainshtein, A. L; Zakharov, V. I. (1983), "Exact Gell-Mann-Low Function Of Supersymmetric 

Yang-Mills Theories From Instanton Calculus", Nuclear Physics B 229 (2): 381-393, doi:10.1016/0550-3213(83)90338-3 

[7] Ryttov, T.; Sannino, F. (2008), "Supersymmetry Inspired QCD Beta Function", Physical Review D 78 (6): 065001, 

doi:10.1103/PhysRevD.78.065001 

[8] Bogolubsky, I. L.; Ilgenfritz, E.-M.; A. I. Muller-Preussker, M.; Sternbeck, A. (2009), "Lattice gluodynamics computation of Landau-gauge 
Green's functions in the deep infrared", Physics Letters B 676 (1-3): 69-73, doi:10.1016/j.physletb.2009.04.076 



Yang-Mills theory 



223 



Further reading 

Books 

• Frampton, P. (2008). Gauge Field Theories (3rd ed.). Wiley-VCH. ISBN 978-3527408351. 

• Cheng, T.-P.; Li, L.-F. (1983). Gauge Theory of Elementary Particle Physics. Oxford University Press. 
ISBN 0-19-851961-3. 

• 't Hooft, Gerardus (2005). 50 Years of Yang-Mills theory. World Scientific. ISBN 981-238-934-2. 
Articles 

• Svetlichny, George (1999). " Preparation for Gauge Theory (http://arxiv.org/abs/math-ph/9902027)". 

• Gross, D. (1992). "Gauge theory - Past, Present and Future" (http://psroc.phys.ntu.edu.tw/cjp/v30/955.pdf). 
Retrieved 2009-04-23. 

External links 

• Yang-Mills theory on Dispersive Wiki (http://tosio.math.toronto.edu/wiki/index.php/Yang-Mills_equations) 

• The Clay Mathematics Institute (http://www.claymath.org) 

• The Millennium Prize Problems (http://www.claymath.org/prizeproblems) 



Yangian 

Yangian is an important structure in modern representation theory, a type of a quantum group with origins in 
physics. Yangians first appeared in the work of Ludvig Faddeev and his school concerning the quantum inverse 
scattering method in the late 1970s and early 1980s. Initially they were considered a convenient tool to generate the 
solutions of the quantum Yang-Baxter equation. The name Yangian was introduced by Vladimir Drinfeld in 1985 in 
honor of C.N. Yang. 



Description 

For any finite-dimensional semisimple Lie algebra a, Drinfeld defined an infinite-dimensional Hopf algebra Y(a), 
called the Yangian of a. This Hopf algebra is a deformation of the universal enveloping algebra U(a[z]) of the Lie 
algebra of polynomial loops of a given by explicit generators and relations. The relations can be encoded by 
identities involving a rational /^-matrix. Replacing it with a trigonometric /^-matrix, one arrives at affine quantum 
groups, defined in the same paper of Drinfeld. 

In the case of the general linear Lie algebra gl^ the Yangian admits a simpler description in terms of a single ternary 
(or RTT) relation on the matrix generators due to Faddeev and coauthors. The Yangian Y(gl^ is defined to be the 
algebra generated by elements 1 < i, j < N and p > 0, subject to the relations 



Defining t\ - ^ — Sij , setting 

P >-i 

and introducing the R-matrix R(z) = I + z -1 P on ® C^, where P is the operator permuting the tensor factors, the 
above relations can be written more simply as the ternary relation: 

R 23 {z - w)T 12 {z)T 13 (w) = T 13 (w)T 12 (z)R 23 {z - w). 
The Yangian becomes a Hopf algebra with comultiplication A, counit 8 and antipode s given by 

(A ® id)T(z) = T 12 {z)T 13 (z), (e <g> id)T(z) = /, (s <g> id)T(z) = T(z)-\ 
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At special values of the spectral parameter {z — w), the /^-matrix degenerates to a rank one projection. This can be 

used to define the quantum determinant of T{z) , which generates the center of the Yangian. 

The twisted Yangian Y~(g/ ), introduced by G. I. Olshansky, is the sub-Hopf algebra generated by the coefficients 

of 

S{z)=T(z)aT(-z), 

where a is the involution of gl 2N given by 

a (Eij) — ( — l)* +J ^2JV-j + l J 2JV-i+l- 

Applications to classical representation theory 

G.I. Olshansky and I.Cherednik discovered that the Yangian of g/^is closely related with the branching properties of 
irreducible finite-dimensional representations of general linear algebras. In particular, the classical Gelfand-Tsetlin 
construction of a basis in the space of such a representation has a natural interpretation in the language of Yangians, 
studied by M.Nazarov and V.Tarasov. Olshansky, Nazarov and Molev later discovered a generalization of this 
theory to other classical Lie algebras, based on the twisted Yangian. 

Applications to physics 

Yangian appears as a symmetry group in different models in physics. The most famous one is super- symmetric 
Yang-Mills field in four dimensions. Yangian also appears as a symmetry group of one dimensional exactly solvable 
models such as spin chains, Hubbard model ^ and in models of one dimensional relativistic quantum field theory. 

Representation theory of Yangians 

Irreducible finite-dimensional representations of Yangians were parametrized by Drinfeld in a way similar to the 
highest weight theory in the representation theory of semisimple Lie algebras. The role of the highest weight is 
played by a finite set of Drinfeld polynomials. Drinfeld also discovered a generalization of the classical Schur-Weyl 
duality between representations of general linear and symmetric groups that involves the Yangian of sl N and the 
degenerate affine Hecke algebra (graded Hecke algebra of type A, in George Lusztig's terminology). 

Representations of Yangians have been extensively studied, but the theory is still under active development. 

References 

• Chari, Vyjayanthi; Andrew Pressley (1994). A Guide to Quantum Groups. Cambridge, U.K.: Cambridge 
University Press. ISBN 0-521-55884-0. 

• Drinfel'd, Vladimir Gershonovich (1985). M Ajire6pti Xonc})a h KBaHTOBoe ypaBHemie 5mra-EaKCTepa [Hopf 
algebras and the quantum Yang-Baxter equation]" (in Russian). Doklady Akademii Nauk SSSR 283 (5): 
1060-1064. 

• Drinfel'd, V. G. (1987). "[A new realization of Yangians and of quantum affine algebras]" (in Russian). Doklady 
Akademii Nauk SSSR 296 (1): 13-17. Translated in Soviet Mathematics - Doklady 36 (2): 212-216. 1988. 

• Drinfel'd, V. G. (1986). "Btipo^eHHtie acJxjMHHtie ajire6pti Teicxe h ifflrciaHbi [Degenerate affine Hecke algebras 
and Yangians]" (in Russian). Funktsional'nyi Analiz i Ego Prilozheniya 20 (1): 69-70. MR831053, 

Zbl: 0599.20049. Translated in Drinfel'd, V. G. (1986). "Degenerate affine hecke algebras and Yangians". 
Functional Analysis and Its Applications 20 (1): 58-60. doi:10.1007/BF01077318. 

• Molev, Alexander Ivanovich (2007). Yangians and Classical Lie Algebras. Mathematical Surveys and 
Monographs. Providence, RI: American Mathematical Society. ISBN 978-0-8218-4374-1. 



Yangian 



225 



References 

[1] http://arxiv.org/pdf/hep-th/93 10158 

[2] http://www.mathnet.ru/php/getFT.phtml?jrnid=faa&paperid=1254&volume=20&year=1986 
option_lang=eng 

Quantum spacetime 

In mathematical physics quantum spacetime is the proposal that the actual spacetime that we live in is more 
accurately described not by usual local coordinates j/ 5 z, £ but operator or algebra variables where the order of at 
least some of the products matters. The idea is borrowed from the canonical commutation relations in quantum 
mechanics where position and momentum variables ^,Pare mutually noncommutative, but is postulated now for 
relations between one or more of the spacetime variables themselves. Just as with Heisenberg's uncertainty principle, 
a quantum spacetime necessarily comes with uncertainty relations in the sense that not all :r, y, z, £ can 
simultaneously be ascribed actual numerical values. 

There are fundamental physical reasons to believe that spacetime is better modeled in this way. Because of wave 
particle duality the energy needed to probe smaller and smaller distances would be greater and greater, until the test 
particles doing the probing would form black holes and destroy the very geometry trying to be measured. This means 
that our usual picture of continuum spacetime must itself break down as we approach such Planck scale distances. 
Quantum spacetime is a particular attempt to address this using ideas from quantum mechanics and is plausibly 
expected on the grounds that the corrections to geometry are being induced by quantum gravity. 

The mathematics allowing such a physical possibility has been developed under the general headings of 
noncommutative geometry and quantum geometry. In practice the more well-known Connes approach to 
noncommutative geometry has not been applied so much here and the currently best-known models of quantum 
spacetime fall within a more pedestrian quantum groups approach to noncommutative geometry, based on symmetry. 

It is important to insist that any noncommutative algebra with four generators qualifies properly to be a called 
quantum spacetime. One should require at least the following: 

• There should be a plausible expectation that such an algebra might actually arise in an effective description of 
quantum gravity effects in some regime of that theory. In this context there should be a parameter \ , say, 
controlling the extent of deviation from ordinary spacetime and ultimately identifiable with physical constants 
such as the Planck length. One should obtain ordinary Lorentzian spacetime as \ _ > 0 . 

• Local Lorentz group and Poincare group symmetries should be retained in some sufficient but possibly 
generalised form. These symmetries of ordinary spacetime are needed for the formulation of Special Relativity 
and their generalisation often takes the form of a quantum group acting on the quantum spacetime algebra. 

• There should be a notion of quantum differential calculus on the quantum spacetime algebra, compatible with the 
(quantum) symmetry and preferably reducing to classical high school differential calculus as \ —> Q . 

These and some partial understanding of the rest of the story are the minimum needed to have wave equations for 
particles and fields and hence first predictions for the deviations from classical spacetime physics. This in turn is 
needed if the theory is ever to be tested. 

Several models were found in the 1990s meeting the above criteria to lesser or greater extents. The most important of 
these has currently testable predictions making the search for quantum spacetime not only a theoretical fancy but a 
potentially an actual new discovery about our physical world. 
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Bicrossproduct model spacetime 

Was introduced in 1994 by Shahn Majid and Henri Ruegg^ and has relations 

for spatial variables X{ and the one time variable t • Here \ has dimensions of length and is therefore expected to 
be something like the Planck length. The Poincare group here is correspondingly deformed, now to a certain 
bicrossproduct quantum group with the following characteristic features. 

The momentum generators Pi commute among themselves but 
addition of momenta, reflected in the quantum group structure, is 
deformed (momentum space becomes a non-abelian group). 
Meanwhile, the Lorentz group generators enjoy their usual relations 
among themselves but act non-linearly on the momentum space. The 
orbits for this action are depicted in the figure as a cross-section of Po o 
against one of the Pi . The on- shell region describing particles in the 
upper centre of the image would normally be hyperboloids but these 
are now v squashed up' into the cylinder 




-2-10 1 2 

Orbits for the action of the Lorentz group on 
momentum space in the construction of the 
bicrossproduct model in units of \~ 1 . 
Mass-shell hyperboloids are v squashed' into a 
cylinder. 



in simplified units. The upshot is that as you try to Lorentz-boost the momentum of a particle you will never exceed 
the Planck momentum. The existence of a highest momentum scale or lowest distance scale fits the physical picture. 
The existence of such squashing behaviour comes from the non-linearity of the action and is an endemic feature of 
bicrossproduct quantum groups known since their introduction in 1988 . Some physicists have been so impressed 
by this feature of the bicrossproduct model that they have dubbed it doubly special relativity but such nomenclature 
remains controversial and disputed. 

Another consequence of the squashing is that the propagation of particles is deformed, even of light, leading to a 

variable speed of light prediction. Key to this prediction was a reason to believe that the particular Po?Piare 

plausibly the physical energy and spatial momentum (as opposed to some other function of them), provided in 1999 

T31 

by Giovanni Amelino-Camelia and Majid through a study of plane waves for a quantum differential calculus in the 
model. They take the form 

in other words a form which is sufficiently close to classical that one might plausibly believe the interpretation. At 
the moment such wave analysis represents the best hope to obtain physically testable predictions form the model. 

Prior to this work there were a number of unsupported claims to make predictions from the model based solely on 

the form of the Poincare quantum group. There were also claims based on an earlier ft -Poincare quantum group 

Mi 

introduced by Jurek Lukierski and co-workers which should be viewed as an important precursor to the 
bicrossproduct one, albeit without the actual quantum spacetime and with different proposed generators for which 
the above picture does not apply. The bicrossproduct model spacetime has also been called ft -deformed spacetime 

with ft = A" 1 
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(/-Deformed spacetime 

Was introduced independently by a team^ working under Julius Wess in 1990 and by Majid and coworkers in a 
series of papers on braided matrices starting a year later^ . The point of view in the second approach is that usual 
Minkowski spacetime has a nice description via Pauli matrices as the space of 2 x 2 hermitian matrices. In quantum 
group theory and using braided monoidal category methods one has a natural q- version of this defined here for real 
values of q as a v braided hermitian matrix' of generators and relations 

(7 s) = (7 P a = ^ M] = °' [/3 ' 7] = (i-'TX*-")' ftfl = (l-T 2 )^ 

These relations say that the generators commute as q —> 1 thereby recovering usual Minkowski space. One can 
work with more familiar variables 2, y , z, £ as linear combinations of these. In particular, time 

t = Trace g ^ = q8 + q~ x a 

is given by a natural braided trace of the matrix and commutes with the other generators (so this model has a very 
different flavour from the bicrossproduct one). The braided-matrix picture also leads naturally to a quantity 

^(7 s) 

which as q —> 1 returns us the usual Minkowski distance (this translates to a metric in the quantum differential 
geometry). The parameter q = e X or q = e lA is dimensionless and \ is thought to be a ratio of the Planck scale 

and the cosmological length. That is, there are indications that that this model relates to quantum gravity with 
non-zero cosmological constant, the choice of q depending on whether this is positive or negative. We have 

described the mathematically better understood but perhaps less physically justified positive case here. 
A full understanding of this model requires (and was concurrent with the development of) a full theory of v braided 
linear algebra' for such spaces. The momentum space for the theory is another copy of the same algebra and there is 
a certain "braided addition' of momentum on it expressed as the structure of a braided Hopf algebra or quantum 
group in a certain braided monoidal category). This theory by 1993 had provided the corresponding q -deformed 
Poincare group as generated by such translations and q -Lorentz transformations, completing the interpretation as a 
quantum spacetime . 

In the process it was discovered that the Poincare group not only had to be deformed but had to be extended to 
include dilations of the quantum spacetime. For such a theory to be exact we would need all particles in the theory to 
be massless, which is consistent with experiment as masses of elementary particles are indeed vanishingly small 
compared to the Planck mass. If current thinking in cosmology is correct then this model is more appropriate, but it 
is significantly more complicated and for this reason its physical predictions have yet to be worked out. 



Fuzzy or spin model spacetime 

Refers in modern usage to the angular momentum algebra 

[2:1,2:2] = 2zAx3, [2:2,2:3] = 2iAxi, [2:3,2:1] = 2iXx2 
familiar from quantum mechanics but regarded now as coordinates of a quantum space or spacetime. This is 
primarily of interest as a toy model of quantum gravity where we work only in 3 spacetime dimensions (not the 
correct 4) and here presented with a Euclidean not Lorentzian signature. It was first proposed in this context by 
Geradus 't Hooft while its full development including a quantum differential calculus and an action of a certain 

rm 

v quantum double' quantum group as deformed Euclidean group of motions was obtained by Majid and E. Batista 

A striking feature of the noncommutative geometry here is that the smallest covariant quantum differential calculus 
has one dimension higher than expected, namely 4, suggesting that the above can also be viewed as the spatial part 
of a 4-dimensional quantum spacetime. The model should not be confused with fuzzy spheres which are 
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finite-dimensional matrix algebras which one can think of as spheres in the spin model spacetime of fixed radius. 

Heisenberg model spacetimes 

The first concrete model of quantum spacetime is often attributed to Hartland Snyder in 1947, however his paper'- 10 -' 
actually proposes that 

where generate and are interpreted as the Lorentz group. This is not an actual quantum spacetime in the sense 
above because the spacetime coordinates %fi do not form a self-contained algebra among themselves, rather Snyder 
was proposing a radical unification of spacetime with the Lorentz and Poincare groups. 

The idea was revived in a modern context by Sergio Doplicher, Claus Fredenhagen and John Roberts in 1995 '- 11 -' by 
letting M^v simply be viewed as some function of %fi as defined by the above relation, and any relations involving 
it viewed as higher order relations among the . The Lorentz symmetry is arranged so as to transform the indices 
as usual and without being deformed. 

An even simpler variant of this model is to let M nere be a numerical antisymmetric tensor, in which context it is 
usually denoted Q , so the relations are [x^, xj\ = i6^ u . In even dimensions ]J any nondegenerate such theta 

can be transformed to a normal form in which this really is just the Heisenberg algebra but the difference that the 
variables are being proposed as those of spacetime. This proposal was for a time quite popular because of its familiar 
form of relations and because it has been argued that it emerges from the theory of open strings landing on 
D-branes, see noncommutative quantum field theory and Moyal plane. However, it should be realised that this 
D-brane lives in some of the higher spacetime dimensions in the theory and hence it is not our physical spacetime 
that string theory suggests to be effectively quantum in this way. You also have to subscribe to D-branes as an 
approach to quantum gravity in the first place. Even when posited as quantum spacetime it is hard to obtain physical 
predictions and one reason for this is that if Q is a tensor then by dimensional analysis it should have dimensions of 
length 2 , and if this length is speculated to be the Planck length then the effects would be even harder to ever detect 
than for other models. 

Noncommutative extensions to spacetime 

Although not quantum spacetime in the sense above, another use of noncommutative geometry is to tack on 
v noncommutative extra dimensions' at each point of ordinary spacetime. Instead of invisible curled up extra 
dimensions as in string theory, Alain Connes and coworkers have argued that the coordinate algebra of this extra part 
should be replaced by a finite-dimensional noncommutative algebra. For a certain reasonable choice of this algebra, 
its representation and extended Dirac operator, one is able to recover the Standard Model of elementary particles. In 
this point of view the different kinds of matter particles are manifestations of geometry in these extra 
noncommutative directions. Connes first works here date from 1989 but has been developed considerably since 
then. Such an approach can theoretically be combined with quantum spacetime as above. 
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Quantum gauge theory 

In order to quantize a gauge theory, like for example Yang-Mills theory, Chern-Simons or BF model, one method is 
to perform a gauge fixing. This is done in the BRST and Batalin-Vilkovisky formulation. Another is to factor out the 
symmetry by dispensing with vector potentials altogether (they're not physically observable anyway) and work 
directly with Wilson loops, Wilson lines contracted with other charged fields at its endpoints and spin networks. 

Older approaches to quantization for Abelian models use the Gupta-Bleuler formalism with a "semi-Hilbert space" 
with an indefinite sesquilinear form. However, it is much more elegant to just work with the quotient space of vector 
field configurations by gauge transformations. 

An alternative approach using lattice approximations is covered in (Wick rotated) lattice gauge theory. 

To establish the existence of the Yang-Mills theory and a mass gap is one of the seven Millennium Prize Problems of 
the Clay Mathematics Institute. 
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The standard model of particle physics is a 
theory concerning the electromagnetic, 
weak, and strong nuclear interactions, which 
mediate the dynamics of the known 
subatomic particles. Developed throughout 
the early and middle 20th century, the 
current formulation was finalized in the mid 
1970s upon experimental confirmation of 
the existence of quarks. Since then, 
discoveries of the bottom quark (1977), the 
top quark (1995) and the tau neutrino (2000) 
have given credence to the standard model. 
Because of its success in explaining a wide 
variety of experimental results, the standard 
model is sometimes regarded as a theory of 
almost everything. 

Still, the standard model falls short of being 
a complete theory of fundamental 
interactions because it does not incorporate 
the physics of general relativity, such as 
gravitation and dark energy. The theory 
does not contain any viable dark matter 

particle that possesses all of the required properties deduced from observational cosmology. It also does not correctly 
account for neutrino oscillations (and their non-zero masses). Although the standard model is theoretically 
self-consistent, it has several unnatural properties giving rise to puzzles like the strong CP problem and the hierarchy 
problem. 

Nevertheless, the standard model is important to theoretical and experimental particle physicists alike. For 
theoreticians, the standard model is a paradigm example of a quantum field theory, which exhibits a wide range of 
physics including spontaneous symmetry breaking, anomalies, non-perturbative behavior, etc. It is used as a basis for 
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The Standard Model of elementary particles, with the gauge bosons in the 
rightmost column. 
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building more exotic models which incorporate hypothetical particles, extra dimensions and elaborate symmetries 
(such as supersymmetry) in an attempt to explain experimental results at variance with the standard model such as 
the existence of dark matter and neutrino oscillations. In turn, the experimenters have incorporated the standard 
model into simulators to help search for new physics beyond the standard model from relatively uninteresting 
background. 

Recently, the standard model has found applications in other fields besides particle physics such as astrophysics and 
cosmology, in addition to nuclear physics. 

Historical background 

The first step towards the Standard Model was Sheldon Glashow's discovery, in 1960, of a way to combine the 
electromagnetic and weak interactions J 1] In 1967, Steven Weinberg 1 ^ and Abdus Salam 1 ^ incorporated the Higgs 
mechanism^ ^ ^ into Glashow's electroweak theory, giving it its modern form. 

The Higgs mechanism is believed to give rise to the masses of all the elementary particles in the Standard Model. 
This includes the masses of the W and Z bosons, and the masses of the fermions - i.e. the quarks and leptons. 

After the neutral weak currents caused by Z boson exchange were discovered at CERN in 1973, [7] [8] [9] [10] the 
electroweak theory became widely accepted and Glashow, Salam, and Weinberg shared the 1979 Nobel Prize in 
Physics for discovering it. The W and Z bosons were discovered experimentally in 1981, and their masses were 
found to be as the Standard Model predicted. 

The theory of the strong interaction, to which many contributed, acquired its modern form around 1973-74, when 
experiments confirmed that the hadrons were composed of fractionally charged quarks. 

Overview 

At present, matter and energy are best understood in terms of the kinematics and interactions of elementary particles. 
To date, physics has reduced the laws governing the behavior and interaction of all known forms of matter and 
energy to a small set of fundamental laws and theories. A major goal of physics is to find the "common ground" that 
would unite all of these theories into one integrated theory of everything, of which all the other known laws would 
be special cases, and from which the behavior of all matter and energy could be derived (at least in principle).^ 1 ^ 

The Standard Model groups two major extant theories — quantum electroweak and quantum chromodynamics — into 
an internally consistent theory that describes the interactions between all known particles in terms of quantum field 
theory. For a technical description of the fields and their interactions, see Standard Model (mathematical 
formulation). 

Particle content 
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Organization of Fermions 
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The Standard Model includes 12 elementary particles of spin- known as fermions. According to the spin- statistics 
theorem, fermions respect the Pauli exclusion principle. Each fermion has a corresponding antiparticle. 

The fermions of the Standard Model are classified according to how they interact (or equivalently, by what charges 
they carry). There are six quarks (up, down, charm, strange, top, bottom), and six leptons (electron, electron 
neutrino, muon, muon neutrino, tau, tau neutrino). Pairs from each classification are grouped together to form a 
generation, with corresponding particles exhibiting similar physical behavior (see table). 

The defining property of the quarks is that they carry color charge, and hence, interact via the strong interaction. A 
phenomenon called color confinement results in quarks being perpetually (or at least since very soon after the start of 
the Big Bang) bound to one another, forming color-neutral composite particles (hadrons) containing either a quark 
and an antiquark (mesons) or three quarks (baryons). The familiar proton and the neutron are the two baryons having 
the smallest mass. Quarks also carry electric charge and weak isospin. Hence they interact with other fermions both 
electromagnetically and via the weak nuclear interaction. 

The remaining six fermions do not carry color charge and are called leptons. The three neutrinos do not carry electric 
charge either, so their motion is directly influenced only by the weak nuclear force, which makes them notoriously 
difficult to detect. However, by virtue of carrying an electric charge, the electron, muon, and tau all interact 
electromagnetically. 

Each member of a generation has greater mass than the corresponding particles of lower generations. The first 
generation charged particles do not decay; hence all ordinary (baryonic) matter is made of such particles. 
Specifically, all atoms consist of electrons orbiting atomic nuclei ultimately constituted of up and down quarks. 
Second and third generations charged particles, on the other hand, decay with very short half lives, and are observed 
only in very high-energy environments. Neutrinos of all generations also do not decay, and pervade the universe, but 
rarely interact with baryonic matter. 
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Gauge bosons 

In the Standard Model, gauge bosons 
are force carriers that mediate the 
strong, weak, and electromagnetic 
fundamental interactions. 

Interactions in physics are the ways 
that particles influence other particles. 
At a macroscopic level, 
electromagnetism allows particles to 
interact with one another via electric 
and magnetic fields, and gravitation 
allows particles with mass to attract 
one another in accordance with 
Einstein's general relativity. The 
standard model explains such forces as 
resulting from matter particles 
exchanging other particles, known as 
force mediating particles (Strictly speaking, this is only so if interpreting literally what is actually an approximation 
method known as perturbation theory, as opposed to the exact theory). When a force mediating particle is exchanged, 
at a macroscopic level the effect is equivalent to a force influencing both of them, and the particle is therefore said to 
have mediated (i.e., been the agent of) that force. The Feynman diagram calculations, which are a graphical form of 
the perturbation theory approximation, invoke "force mediating particles" and when applied to analyze high-energy 
scattering experiments are in reasonable agreement with the data. Perturbation theory (and with it the concept of 
"force mediating particle") in other situations fails. These include low-energy QCD, bound states, and solitons. 

The gauge bosons of the Standard Model also all have spin (as do matter particles), but in their case, the value of the 
spin is 1, making them bosons. As a result, they do not follow the Pauli exclusion principle. The different types of 
gauge bosons are described below. 

• Photons mediate the electromagnetic force between electrically charged particles. The photon is massless and is 
well-described by the theory of quantum electrodynamics. 

• The W + , W~, and Z gauge bosons mediate the weak interactions between particles of different flavors (all quarks 
and leptons). They are massive, with the Z being more massive than the W ± . The weak interactions involving the 
W ± act on exclusively left-handed particles and right-handed antiparticles. Furthermore, the W ± carry an electric 
charge of +1 and -1 and couple to the electromagnetic interactions. The electrically neutral Z boson interacts with 
both left-handed particles and antiparticles. These three gauge bosons along with the photons are grouped together 
which collectively mediate the electro weak interactions. 

• The eight gluons mediate the strong interactions between color charged particles (the quarks). Gluons are 

massless. The eightfold multiplicity of gluons is labeled by a combination of color and an anticolor charge (e.g., 
ri2i 

red-antigreen). Because the gluon has an effective color charge, they can interact among themselves. The 
gluons and their interactions are described by the theory of quantum chromodynamics. 

The interactions between all the particles described by the Standard Model are summarized by the diagram at the top 
of this section. 
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Higgs boson 

The Higgs particle is a hypothetical massive scalar elementary particle theorized by Robert Brout, Francois Englert, 
Peter Higgs, Gerald Guralnik, C. R. Hagen, and Tom Kibble in 1964 (see 1964 PRL symmetry breaking papers) and 
is a key building block in the Standard Model J 13 ^ ^ ^ ^ It has no intrinsic spin, and for that reason is classified 
as a boson (like the gauge bosons, which have integer spin). Because an exceptionally large amount of energy and 
beam luminosity are theoretically required to observe a Higgs boson in high energy colliders, it is the only 
fundamental particle predicted by the Standard Model that has yet to be observed. 

The Higgs boson plays a unique role in the Standard Model, by explaining why the other elementary particles, the 
photon and gluon excepted, are massive. In particular, the Higgs boson would explain why the photon has no mass, 
while the W and Z bosons are very heavy. Elementary particle masses, and the differences between 
electromagnetism (mediated by the photon) and the weak force (mediated by the W and Z bosons), are critical to 
many aspects of the structure of microscopic (and hence macroscopic) matter. In electroweak theory, the Higgs 
boson generates the masses of the leptons (electron, muon, and tau) and quarks. 

As yet, no experiment has directly detected the existence of the Higgs boson. It is hoped that the Large Hadron 

Collider at CERN will confirm the existence of this particle. It is also possible that the Higgs boson may already 

ri7i 

have been produced but overlooked. 

Field content 

The standard model has the following fields: 

Spinl 

1. A U(l) gauge field B^ with coupling g' (weak U(l), or weak hypercharge) 

2. An SU(2) gauge field W with coupling g (weak SU(2), or weak isospin) 

3. An SU(3) gauge field with coupling g^ (strong SU(3), or color charge) 

Spin V 2 

The spin V 2 particles are in representations of the gauge groups. For the U(l) group, we list the value of the weak 
hypercharge instead. The left-handed fermionic fields are: 

1. An SU(3) triplet, SU(2) doublet, with U(l) weak hypercharge V (left-handed quarks) 

2. An SU(3) triplet, SU(2) singlet, with U(l) weak hypercharge / (left-handed down-type antiquark) 

3. An SU(3) singlet, SU(2) doublet with U(l) weak hypercharge -1 (left-handed lepton) 

4. An SU(3) triplet, SU(2) singlet, with U(l) weak hypercharge - 4 / 3 (left-handed up-type antiquark) 

5. An SU(3) singlet, SU(2) singlet with U(l) weak hypercharge 2 (left-handed antilepton) 

By CPT symmetry, there is a set of right-handed fermions with the opposite quantum numbers. 

This describes one generation of leptons and quarks, and there are three generations, so there are three copies of each 
field. Note that there are twice as many left-handed lepton field components as left-handed antilepton field 
components in each generation, but an equal number of left-handed quark and antiquark fields. 
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SpinO 

1. An SU(2) doublet H with U(l) hyper-charge -1 (Higgs field) 

2 

Note that \H\ , summed over the two SU(2) components, is invariant under both SU(2) and under U(l), and so it can 
appear as a renormalizable term in the Lagrangian, as can its square. 

This field acquires a vacuum expectation value, leaving a combination of the weak isospin, and weak hypercharge 
unbroken. This is the electromagnetic gauge group, and the photon remains massless. The standard formula for the 
electric charge (which defines the normalization of the weak hypercharge, Y, which would otherwise be somewhat 
arbitrary) is:^ 

Y 

Q = h + T 

Lagrangian 

The Lagrangian for the spin 1 and spin V fields is the most general renormalizable gauge field Lagrangian with no 
fine tunings: 

• Spin 1: 

where the traces are over the SU(2) and SU(3) indices hidden in W and G respectively. The two-index objects are the 
field strengths derived from W and G the vector fields. There are also two extra hidden parameters: the theta angles 
for SU(2) and SU(3). 

The spin-V 2 particles can have no mass terms because there is no right/left helicity pair with the same SU(2) and 
SU(3) representation and the same weak hypercharge. This means that if the gauge charges were conserved in the 
vacuum, none of the spin V particles could ever swap helicity, and they would all be massless. 

For a neutral fermion, for example a hypothetical right-handed lepton N (or N 01 in relativistic two-spinor notation), 
with no SU(3), SU(2) representation and zero charge, it is possible to add the term: 

J MN a N^e af} + N A Nfit&. 

This term gives the neutral fermion a Majorana mass. Since the generic value for M will be of order 1, such a particle 
would generically be unacceptably heavy. The interactions are completely determined by the theory - the leptons 
introduce no extra parameters. 

Higgs mechanism 

The Lagrangian for the Higgs includes the most general renormalizable self interaction: 

5 Higgs = f d*x [(D li H)*(D' l H) + \{\H\ 2 - v 2 f] ■ 

2 

The parameter v has dimensions of mass squared, and it gives the location where the classical Lagrangian is at a 

2 

minimum. In order for the Higgs mechanism to work, v must be a positive number, v has units of mass, and it is the 
only parameter in the standard model which is not dimensionless. It is also much smaller than the Planck scale; it is 
approximately equal to the Higgs mass, and sets the scale for the mass of everything else. This is the only real 
fine-tuning to a small nonzero value in the standard model, and it is called the Hierarchy problem. 

It is traditional to choose the SU(2) gauge so that the Higgs doublet in the vacuum has expectation value (v,0). 
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Masses and CKM matrix 

The rest of the interactions are the most general spin-0 spin- 1 /^ Yukawa interactions, and there are many of these. 
These constitute most of the free parameters in the model. The Yukawa couplings generate the masses and mixings 
once the Higgs gets its vacuum expectation value. 

The terms L HR generate a mass term for each of the three generations of leptons. There are 9 of these terms, but by 
relabeling L and R, the matrix can be diagonalized. Since only the upper component of H is nonzero, the upper 
SU(2) component of L mixes with R to make the electron, the muon, and the tau, leaving over a lower massless 
component, the neutrino. {Neutrino oscillation show neutrinos have mass, http://operaweb.lngs.infn.it/spip. 
php?rubriquel4 31May2010 Press Release.} 

The terms QHU generate up masses, while QHD generate down masses. But since there is more than one 
right-handed singlet in each generation, it is not possible to diagonalize both with a good basis for the fields, and 
there is an extra CKM matrix. 

Theoretical aspects 

Construction of the Standard Model Lagrangian 



Parameters of the Standard Model 



Symbol 


Description 


Renormalization 
scheme (point) 


Value 


m 

e 


Electron mass 




511 keV 


m 

\i 


Muon mass 




105.7 MeV 


m 

X 


Tau mass 




1.78 GeV 


m 

u 


Up quark mass 


"MS = 2GeV 


1.9 MeV 


m 

d 


Down quark mass 


"MS = 2GeV 


4.4 MeV 


m 

s 


Strange quark mass 


"MS = 2GeV 


87 MeV 


m 

c 


Charm quark mass 


"MS = m c 


1.32 GeV 


m, 
b 


Bottom quark mass 


"MS = m b 


4.24 GeV 


m 

t 


Top quark mass 


On-shell scheme 


172.7 GeV 


9 n 


CKM 12-mixing angle 




13.1° 


9 23 


CKM 23 -mixing angle 




2.4° 


°n 


CKM 13-mixing angle 




0.2° 


6 


CKM CP-violating Phase 




0.995 


h 


U(l) gauge coupling 


"MS = m z 


0.357 


8 2 


SU(2) gauge coupling 


"MS = m z 


0.652 


8 3 


SU(3) gauge coupling 


"MS = m z 


1.221 


e 

QCD 


QCD vacuum angle 




~0 


n 


Higgs quadratic coupling 




Unknown 




Higgs self-coupling strength 




Unknown 



Technically, quantum field theory provides the mathematical framework for the standard model, in which a 
Lagrangian controls the dynamics and kinematics of the theory. Each kind of particle is described in terms of a 
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dynamical field that pervades space-time. The construction of the standard model proceeds following the modern 
method of constructing most field theories: by first postulating a set of symmetries of the system, and then by writing 
down the most general renormalizable Lagrangian from its particle (field) content that observes these symmetries. 

The global Poincare symmetry is postulated for all relativistic quantum field theories. It consists of the familiar 
translational symmetry, rotational symmetry and the inertial reference frame in variance central to the theory of 
special relativity. The local SU(3)xSU(2)xU(l) gauge symmetry is an internal symmetry that essentially defines the 
standard model. Roughly, the three factors of the gauge symmetry give rise to the three fundamental interactions. 
The fields fall into different representations of the various symmetry groups of the Standard Model (see table). Upon 
writing the most general Lagrangian, one finds that the dynamics depend on 19 parameters, whose numerical values 
are established by experiment. The parameters are summarized in the table at right. 

The QCD sector 

The QCD sector defines the interactions between quarks and gluons, with SU(3) symmetry, generated by T a . Since 
leptons do not interact with gluons, they are not affected by this sector. 

C-qcd = U(d» - ig s G°T a )rU + D{d» - ig^^-fD. 

is the gluon field strength, 7^ are the Dirac matrices, D stands for the isospin doublet section, U stands for a 
unitary matrix, and g § is the strong coupling constant. 

The electroweak sector 

The electroweak sector is a Yang-Mills gauge theory with the symmetry group U(l)xSU(2) L , 



1/1 ^ ' 



where is the U(l) gauge field; is the weak hypercharge — the generator of the U(l) group; W^is the 

three-component SU(2) gauge field; 7^ are the Pauli matrices — infinitesimal generators of the SU(2) group. The 
subscript L indicates that they only act on left fermions; g' and g are coupling constants. 

The Higgs sector 

In the Standard Model, the Higgs field is a complex spinor of the group SU(2) L : 



where the indexes + and 0 indicate the electric charge (Q) of the components. The weak isospin (Y^) of both 
components is 1. 

Before symmetry breaking, the Higgs Lagrangian is: 

A 2 



£h = (d, - \ (g'YwB, + grW,) ){%+\ (<7%^ + 9^) ) <P ~ T faV - , 



which can also be written as: 



d^ + ^g'YwB^ + grW^tp 



v2 



X 2 
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Additional symmetries of the Standard Model 

From the theoretical point of view, the Standard Model exhibits four additional global symmetries, not postulated at 
the outset of its construction, collectively denoted accidental symmetries, which are continuous U(l) global 
symmetries. The transformations leaving the Lagrangian invariant are: 

V q (z) -> e-/ 3 ^ 

E L -> e if, E L and (e R ) c -> e ip (e R ) c 
M L -> M L and -> e^(^) c 

T L -> e^T L and (t*) c -> e^(r fl ) c . 
The first transformation rule is shorthand meaning that all quark fields for all generations must be rotated by an 
identical phase simultaneously. The fields , and (fJ,R) c , (r^) c are the 2nd (muon) and 3rd (tau) 
generation analogs of i^and (e^) c fields. 

By Noether's theorem, each symmetry above has an associated conservation law: the conservation of baryon number, 
electron number, muon number, and tau number. Each quark is assigned a baryon number of 1/3, while each 
antiquark is assigned a baryon number of -1/3. Conservation of baryon number implies that the number of quarks 
minus the number of antiquarks is a constant. Within experimental limits, no violation of this conservation law has 
been found. 

Similarly, each electron and its associated neutrino is assigned an electron number of +1, while the antielectron and 
the associated antineutrino carry -1 electron number. Similarly, the muons and their neutrinos are assigned a muon 
number of +1 and the tau leptons are assigned a tau lepton number of +1. The Standard Model predicts that each of 
these three numbers should be conserved separately in a manner similar to the way baryon number is conserved. 
These numbers are collectively known as lepton family numbers (LF). Symmetry works differently for quarks than 
for leptons, mainly because the Standard Model predicts that neutrinos are massless. However, it was recently found 
that neutrinos have small masses and oscillate between flavors, signaling that the conservation of lepton family 
number is violated. 

In addition to the accidental (but exact) symmetries described above, the Standard Model exhibits several 
approximate symmetries. These are the M SU(2) custodial symmetry" and the M SU(2) or SU(3) quark flavor 
symmetry." 



Symmetries of the Standard Model and Associated Conservation Laws 



Symmetry 


Lie Group 


Symmetry Type 


Conservation Law 


Poincare 


TranslationsxSO(3,l) 


Global symmetry 


Energy, Momentum, Angular momentum 


Gauge 


SU(3)xSU(2)xU(l) 


Local symmetry 


Color charge, Weak isospin, Electric charge, Weak hypercharge 


Baryon phase 


U(l) 


Accidental Global symmetry 


Baryon number 


Electron phase 


U(l) 


Accidental Global symmetry 


Electron number 


Muon phase 


U(l) 


Accidental Global symmetry 


Muon number 


Tau phase 


U(l) 


Accidental Global symmetry 


Tau number 
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Field content of the Standard Model 



Field 
(1st generation) 


Spin 


Gauge group 
Representation 


Bar yon 
Number 


Electron 
Number 


Left-handed quark 


Ql 


1/2 


(3, 2, +1/3) 


1/3 


0 


Left-handed up antiquark 


Ut = (uviY 
""Li — V it / 


1/2 


(3, 1, -4/3) 


-1/3 


0 


Left-handed down antiquark 




1/2 


(3, 1.+2/3) 


-1/3 


0 


Left-handed lepton 




1/2 


(1,2, -1) 


0 


1 


Left-handed antielectron 


§l = (e R ) c 


1/2 


(1, l,+2) 


0 


-1 


Hypercharge gauge field 




1 


( 1, 1, 0) 


0 


0 


Isospin gauge field 


w. 


1 


(1,3,0) 


0 


0 


Gluon field 




1 


(8, 1, 0) 


0 


0 


Higgs field 


H 


0 


(1,2,+1) 


0 


0 



List of standard model fermions 

This table is based in part on data gathered by the Particle Data Group J 19 ^ 

Left-handed fermions in the Standard Model 



Generation 1 


Fermion 
(left-handed) 


Symbol 


Electric 
charge 


Weak 
isospin 


Weak 
hypercharge 


Color 
charge * 


Mass ** 


Electron 


e~ 


-1 


-1/2 


-1 


1 


511 keV 


Positron 


e + 


+ 1 


0 


+2 


1 


511 keV 


Electron neutrino 


v e 


0 


+1/2 


-1 


1 


< 2 eV **** 


Electron antineutrino 


i> e 


0 


0 


0 


1 


< 2 eV **** 


Up quark 


u 


+2/3 


+1/2 


+1/3 


3 


~ 3 MeV *** 


Up antiquark 


u 


-2/3 


0 


-4/3 


3 


~ 3 MeV *** 


Down quark 


d 


-1/3 


-1/2 


+1/3 


3 


~6MeV*** 


Down antiquark 


d 


+1/3 


0 


+2/3 


3 


~6MeV*** 




Generation 2 


Fermion 
(left-handed) 


Symbol 


Electric 
charge 


Weak 
isospin 


Weak 
hypercharge 


Color 
charge * 


Mass ** 


Muon 




-1 


-1/2 


-1 


1 


106 MeV 


Antimuon 




+ 1 


0 


+2 


1 


106 MeV 


Muon neutrino 




0 


+1/2 


-1 


1 


< 2 eV **** 


Muon antineutrino 




0 


0 


0 


1 


< 2 eV **** 


Charm quark 


c 


+2/3 


+1/2 


+1/3 


3 


~ 1.337 GeV 


Charm antiquark 


c 


-2/3 


0 


-4/3 


3 


~ 1.3 GeV 


Strange quark 


s 


-1/3 


-1/2 


+1/3 


3 


~ 100 MeV 


Strange antiquark 


s 


+1/3 


0 


+2/3 


3 


~ 100 MeV 
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Generation 3 


Fermion 
(left-handed) 


Symbol 


Electric 
charge 


Weak 
isospin 


Weak 
hypercharge 


Color 
charge * 


Mass ** 


Tau 


T 


-1 


-1/2 


-1 


1 


1.78 GeV 


Antitau 


7 


+ 1 


0 


+2 


1 


1.78 GeV 


Tau neutrino 


V T 


0 


+1/2 


-1 


1 


< 2 eV **** 


Tau antineutrino 


V T 


0 


0 


0 


1 


< 2 eV **** 


Top quark 


t 


+2/3 


+1/2 


+1/3 


3 


171 GeV 


Top antiquark 


I 


-2/3 


0 


-4/3 


3 


171 GeV 


Bottom quark 


b 


-1/3 


-1/2 


+1/3 


3 


~ 4.2 GeV 


Bottom antiquark 


b 


+1/3 


0 


+2/3 


3 


~ 4.2 GeV 


Notes: 














• * These are not ordinary abelian charges, which can be added together, but are labels of group representations of Lie groups. 

• ** Mass is really a coupling between a left-handed fermion and a right-handed fermion. For example, the mass of an electron is really a 
coupling between a left-handed electron and a right-handed electron, which is the antiparticle of a left-handed positron. Also neutrinos show 
large mixings in their mass coupling, so it's not accurate to talk about neutrino masses in the flavor basis or to suggest a left-handed electron 
antineutrino. 

• *** The masses of baryons and hadrons and various cross-sections are the experimentally measured quantities. Since quarks can't be isolated 
because of QCD confinement, the quantity here is supposed to be the mass of the quark at the renormalization scale of the QCD scale. 

• **** The Standard Model assumes that neutrinos are massless. However, several contemporary experiments prove that neutrinos oscillate 
between their flavour states, which could not happen if all were massless. It is straightforward to extend the model to fit these data but there 
are many possibilities, so the mass eigenstates are still open. See Neutrino#Mass. 



Tests and predictions 

The Standard Model (SM) predicted 
the existence of the W and Z bosons, 
gluon, and the top and charm quarks ' 
before these particles were observed. 
Their predicted properties were 
experimentally confirmed with good 
precision. To give an idea of the 

success of the SM, the following table compares the measured masses of the W and Z bosons with the masses 
predicted by the SM: 



|e 










: 

T 




n-p n - TT 


Li 


□ 

n 


1 j 


D 

lit 


',0 


tfl 

<*> 



Log plot of masses in the Standard Model. 



Quantity 


Measured (GeV) 


SM prediction (GeV) 


Mass of W boson 


80.398 ± 0.025 


80.390 ±0.018 


Mass of Z boson 


91.1876 ±0.0021 


91.1874 ±0.0021 



The SM also makes several predictions about the decay of Z bosons, which have been experimentally confirmed by 
the Large Electron-Positron Collider at CERN. 
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Challenges to the standard model 

There is some experimental evidence consistent with neutrinos having mass, which the Standard Model does not 
allow. To accommodate such findings, the Standard Model can be modified by adding a non-renormalizable 
interaction of lepton fields with the square of the Higgs field. This is natural in certain grand unified theories, and if 
new physics appears at about 10 16 GeV, the neutrino masses are of the right order of magnitude. 

Currently, there is one elementary particle predicted by the Standard Model that has yet to be observed: the Higgs 
boson. A major reason for building the Large Hadron Collider is that the high energies of which it is capable are 
expected to make the Higgs observable. However, as of August 2008, there is only indirect empirical evidence for 
the existence of the Higgs boson, so that its discovery cannot be claimed. Moreover, there are serious theoretical 
reasons for supposing that elementary scalar Higgs particles cannot exist (see Quantum triviality). 

A fair amount of theoretical and experimental research has attempted to extend the Standard Model into a Unified 
Field Theory or a Theory of everything, a complete theory explaining all physical phenomena including constants. 
Inadequacies of the Standard Model that motivate such research include: 

• It does not attempt to explain gravitation, and unlike for the strong and electroweak interactions of the Standard 
Model, there is no known way of describing general relativity, the canonical theory of gravitation, consistently in 
terms of quantum field theory. The reason for this is among other things that quantum field theories of gravity 
generally break down before reaching the Planck scale. As a consequence, we have no reliable theory for the very 
early universe; 

• It seems rather ad-hoc and inelegant, requiring 19 numerical constants whose values are unrelated and arbitrary. 
Although the Standard Model, as it now stands, can explain why neutrinos have masses, the specifics of neutrino 
mass are still unclear. It is believed that explaining neutrino mass will require an additional 7 or 8 constants, 
which are also arbitrary parameters; 

• The Higgs mechanism gives rise to the hierarchy problem if any new physics (such as quantum gravity) is present 
at high energy scales. In order for the weak scale to be much smaller than the Planck scale, severe fine tuning of 
Standard Model parameters is required; 

• It should be modified so as to be consistent with the emerging "standard model of cosmology." In particular, the 
Standard Model cannot explain the observed amount of cold dark matter (CDM) and gives contributions to dark 
energy which are far too large. It is also difficult to accommodate the observed predominance of matter over 
antimatter (matter/antimatter asymmetry). The isotropy and homogeneity of the visible universe over large 
distances seems to require a mechanism like cosmic inflation, which would also constitute an extension of the 
Standard Model. 

Currently no proposed Theory of everything has been conclusively verified. 
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Topological quantum field theory 



A topological quantum field theory (or topological field theory or TQFT) is a quantum field theory which 
computes topological invariants. 

Although TQFTs were invented by physicists, they are also of mathematical interest, being related to, among other 
things, knot theory and the theory of four-manifolds in algebraic topology, and to the theory of moduli spaces in 
algebraic geometry. Donaldson, Jones, Witten, and Kontsevich have all won Fields Medals for work related to 
topological field theory. 

In condensed matter physics, topological quantum field theories are the low energy effective theories of 
topologically ordered states, such as fractional quantum Hall states, string-net condensed states, and other strongly 
correlated quantum liquid states. 



In a topological field theory, the correlation functions do not depend on the metric on spacetime. This means that the 
theory is not sensitive to changes in the shape of spacetime; if the spacetime warps or contracts, the correlation 
functions do not change. Consequently, they are topological invariants. 

Topological field theories are not very interesting on the flat Minkowski spacetime used in particle physics. 
Minkowski space can be contracted to a point, so a TQFT on Minkowski space computes only trivial topological 
invariants. Consequently, TQFTs are usually studied on curved spacetimes, such as, for example, Riemann surfaces. 
Most of the known topological field theories are defined on spacetimes of dimension less than five. It seems that a 
few higher dimensional theories exist, but they are not very well understood. 

Quantum gravity is believed to be background-independent (in some suitable sense), and TQFTs provide examples 
of background independent quantum field theories. This has prompted ongoing theoretical investigation of this class 
of models. 

(Caveat: It is often said that TQFTs have only finitely many degrees of freedom. This is not a fundamental property. 
It happens to be true in most of the examples that physicists and mathematicians study, but it is not necessary. A 
topological sigma model with target infinite-dimensional projective space, if such a thing could be defined, would 
have countably infinitely many degrees of freedom.) 

Specific models 

The known topological field theories fall into two general classes: Schwarz-type TQFTs and Witten-type TQFTs. 
Witten TQFTs are also sometimes referred to as cohomological field theories. 

Schwarz-type TQFTs 

In Schwarz-type TQFTs, the correlation functions computed by the path integral are topological invariants because 
the path integral measure and the quantum field observables are explicitly independent of the metric. For instance, in 
the BF model, the spacetime is a two-dimensional manifold M, the observables are constructed from a two-form F, 
an auxiliary scalar B, and their derivatives. The action (which determines the path integral) is 



Jm 

The spacetime metric does not appear anywhere in this theory, so the theory is explicitly topologically invariant. 
Another, more famous example is Chern-Simons theory, which can be used to compute knot invariants. 



Overview 
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Witten-type TQFTs 

In Witten-type topological field theories, the topological in variance is more subtle. For example the Lagrangian for 
the WZW model does depend explicitly on the metric, but one shows by calculation that the expectation value of the 
partition function and a special class of correlation functions are in fact diffeomorphism invariant. 

Mathematical formulations 

Atiyah-Segal axioms 

Atiyah suggested a set of axioms for topological quantum field theory which was inspired by Segal's proposed 
axioms for conformal field theory, (Atiyah 1988). These axioms have been relatively useful for mathematical 
treatments of Schwarz-type QFTs, although it isn't clear that they capture the whole structure of Witten-type QFTs. 
The basic idea is that a TQFT is a functor from a certain category of cobordisms to the category of vector spaces. 

There are in fact two different sets of axioms which could reasonably be called the Atiyah axioms. These axioms 
differ basically in whether or not they study a TQFT defined on a single fixed ^-dimensional Riemannian / 
Lorentzian spacetime Mora TQFT defined on all ^-dimensional spacetimes at once. 

[ed. What follows is still in rough draft form and should be regarded suspiciously.] 

The case of a fixed spacetime 

Let BordM^e the category whose morphisms are ^-dimensional submanifolds of M and whose objects are 
connected components of the boundaries of such submanifolds. Regard two morphisms as equivalent if they are 
homotopic via submanifolds of M, and so form the quotient category hBordM : The objects in hB or dj^ are the 
objects of Bordjtf' an d the morphisms of /^Bord/^ are homotopy equivalence classes of morphisms in BordM 

A TQFT on M is a symmetric monoidal functor from HBordM^ 0 me category of vector spaces. 
Note that cobordisms can, if their boundaries match up, be sewn together to form a new bordism. This is the 
composition law for morphisms in the cobordism category. Since functors are required to preserve composition, this 
says that the linear map corresponding to a sewn together morphism is just the composition of the linear map for 
each piece. 

There is an equivalence of categories between the category of 2-dimensional topological quantum field theories and 
the category of commutative Frobenius algebras. 

All n-dimensional spacetimes at once 

To consider all spacetimes at once, it is necessary to replace 
hBordM^y a larger category. So let Bord n be the category of 
bordisms, i.e. the category whose morphisms are n-dimensional 
manifolds with boundary, and whose objects are the connected 
components of the boundaries of n-dimensional manifolds. (Note that 
any (n — 1) -dimensional manifold may appear as an object in 
Bord n .) As above, regard two morphisms in Bord n as equivalent 
if they are homotopic, and form the quotient category hBord n . 
Bord n is a monoidal category under the operation which takes two 
bordisms to the bordism made from their disjoint union. A TQFT on 
n-dimensional manifolds is then a functor from hBord n to the 
category of vector spaces, which takes disjoint unions of bordisms to 
the tensor product f [ed. unfinished] 




The pair of pants is a (l+l)-dimensional bordism, 
which corresponds to a product or coproduct in a 
2-dimensional TQFT. 
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For example, for (l+l)-dimensional bordisms (2-dimensional bordisms between 1 -dimensional manifolds), the map 
associated with a pair of pants gives a product or coproduct, depending on how the boundary components are 
grouped - which is commutative or cocommutative, while the map associated with a disk gives a counit (trace) or 
unit (scalars), depending on grouping of boundary, and thus (l+l)-dimension TQFTs correspond to Frobenius 
algebras. 

Generalizations 

For some applications, it is convenient to demand extra topological structure on the morphisms, such as a choice of 
orientation. 
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Quantum Chromodynamics 

In theoretical physics, quantum chromodynamics (QCD) is a theory of the strong interaction (color force), a 
fundamental force describing the interactions of the quarks and gluons making up hadrons (such as the proton, 
neutron or pion). It is the study of the SU(3) Yang-Mills theory of color-charged fermions (the quarks). QCD is a 
quantum field theory of a special kind called a non-abelian gauge theory. It is an important part of the Standard 
Model of particle physics. A huge body of experimental evidence for QCD has been gathered over the years. 

QCD enjoys two peculiar properties: 

• Confinement, which means that the force between quarks does not diminish as they are separated. Because of 
this, it would take an infinite amount of energy to separate two quarks; they are forever bound into hadrons such 
as the proton and the neutron. Although analytically unproven, confinement is widely believed to be true because 
it explains the consistent failure of free quark searches, and it is easy to demonstrate in lattice QCD. 

• Asymptotic freedom, which means that in very high-energy reactions, quarks and gluons interact very weakly. 
This prediction of QCD was first discovered in the early 1970s by David Politzer and by Frank Wilczek and 
David Gross. For this work they were awarded the 2004 Nobel Prize in Physics. 

There is no known phase-transition line separating these two properties; confinement is dominant in low-energy 
scales but, as energy increases, asymptotic freedom becomes dominant. 

Terminology 

The word quark was coined by American physicist Murray Gell-Mann (b. 1929) in its present sense. It originally 
comes from the phrase "Three quarks for Muster Mark" in Finnegans Wake by James Joyce. On June 27, 1978, 
Gell-Mann wrote a private letter to the editor of the Oxford English Dictionary, in which he related that he had been 
influenced by Joyce's words: "The allusion to three quarks seemed perfect." (Originally, only three quarks had been 
discovered.) Gell-Mann, however, wanted to pronounce the word with (6) not (a), as Joyce seemed to indicate by 
rhyming words in the vicinity such as Mark. Gell-Mann got around that "by supposing that one ingredient of the line 
'Three quarks for Muster Mark' was a cry of 'Three quarts for Mister . . . ' heard in H.C. Earwicker's pub," a plausible 
suggestion given the complex punning in Joyce's novel. ^ 

The three kinds of charge in QCD (as opposed to one in quantum electrodynamics or QED) are usually referred to as 
"color charge" by loose analogy to the three kinds of color (red, green and blue) perceived by humans. Other than 
this "clever" nomenclature, the quantum parameter "color" is completely unrelated to the everyday, familiar 
phenomenon of color. 

Since the theory of electric charge is dubbed "electrodynamics", the Greek word "chroma" Xpcojia (meaning color) 
is applied to the theory of color charge, "chromodynamics". 

History 

With the invention of bubble chambers and spark chambers in the 1950s, experimental particle physics discovered a 
large and ever-growing number of particles called hadrons. It seemed that such a large number of particles could not 
all be fundamental. First, the particles were classified by charge and isospin by Eugene Wigner and Werner 
Heisenberg; then, in 1953, according to strangeness by Murray Gell-Mann and Kazuhiko Nishijima. To gain greater 
insight, the hadrons were sorted into groups having similar properties and masses using the eightfold way, invented 
in 1961 by Gell-Mann and Yuval Ne'eman. Gell-Mann and George Zweig, correcting an earlier approach of Shoichi 
Sakata, went on to propose in 1963 that the structure of the groups could be explained by the existence of three 
flavours of smaller particles inside the hadrons: the quarks. 
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Perhaps the first remark that quarks should possess an additional quantum number was made as a short footnote in 
the preprint of Boris Struminsky 1 in connection with Q- hyperon composed of three strange quarks with parallel 
spins (this situation was peculiar, because since quarks are fermions, such combination is forbidden by the Pauli 
exclusion principle): 

Three identical quarks cannot form an antisymmetric S-state. In order to realize an antisymmetric orbital 
S -state, it is necessary for the quark to have an additional quantum number. 

- B. V. Struminsky, Magnetic moments ofbarions in the quark model, JINR-Preprint P-1939, Dubna, 

Submitted on January 7, 1965 

Boris Struminsky was a PhD student of Nikolay Bogolyubov. The problem considered in this preprint was suggested 

by Nikolay Bogolyubov, who advised Boris Struminsky in this research. In the beginning of 1965, Nikolay 

Bogolyubov, Boris Struminsky and Albert Tavchelidze wrote a preprint with a more detailed discussion of the 

Mi 

additional quark quantum degree of freedom. This work was also presented by Albert Tavchelidze without 
obtaining consent of his collaborators for doing so at an international conference in Trieste (Italy), in May 1965 ^ 

A similar mysterious situation was with the A ++ baryon; in the quark model, it is composed of three up quarks with 
parallel spins. In 1965, Moo- Young Han with Yoichiro Nambu and Oscar W. Greenberg independently resolved the 
problem by proposing that quarks possess an additional SU(3) gauge degree of freedom, later called color charge. 
Han and Nambu noted that quarks might interact via an octet of vector gauge bosons: the gluons. 

Since free quark searches consistently failed to turn up any evidence for the new particles, and because an 
elementary particle back then was defined as a particle which could be separated and isolated, Gell-Mann often said 
that quarks were merely convenient mathematical constructs, not real particles. The meaning of this statement was 
usually clear in context: He meant quarks are confined, but he also was implying that the strong interactions could 
probably not be fully described by quantum field theory. 

Richard Feynman argued that high energy experiments showed quarks are real particles: he called them partons 
(since they were parts of hadrons). By particles, Feynman meant objects which travel along paths, elementary 
particles in a field theory. 

The difference between Feynman's and Gell-Mann's approaches reflected a deep split in the theoretical physics 
community. Feynman thought the quarks have a distribution of position or momentum, like any other particle, and 
he (correctly) believed that the diffusion of parton momentum explained diffractive scattering. Although Gell-Mann 
believed that certain quark charges could be localized, he was open to the possibility that the quarks themselves 
could not be localized because space and time break down. This was the more radical approach of S-matrix theory. 

James Bjorken proposed that pointlike partons would imply certain relations should hold in deep inelastic scattering 
of electrons and protons, which were spectacularly verified in experiments at SLAC in 1969. This led physicists to 
abandon the S-matrix approach for the strong interactions. 

The discovery of asymptotic freedom in the strong interactions by David Gross, David Politzer and Frank Wilczek 
allowed physicists to make precise predictions of the results of many high energy experiments using the quantum 
field theory technique of perturbation theory. Evidence of gluons was discovered in three jet events at PETRA in 
1979. These experiments became more and more precise, culminating in the verification of perturbative QCD at the 
level of a few percent at the LEP in CERN. 

The other side of asymptotic freedom is confinement. Since the force between color charges does not decrease with 
distance, it is believed that quarks and gluons can never be liberated from hadrons. This aspect of the theory is 
verified within lattice QCD computations, but is not mathematically proven. One of the Millennium Prize Problems 
announced by the Clay Mathematics Institute requires a claimant to produce such a proof. Other aspects of 
non-perturbative QCD are the exploration of phases of quark matter, including the quark-gluon plasma. 

The relation between the short-distance particle limit and the confining long-distance limit is one of the topics 
recently explored using string theory, the modern form of S-matrix theory ^ 
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Theory 

Some definitions 

Every field theory of particle physics is based on certain symmetries of nature whose existence is deduced from 
observations. These can be 

• local symmetries, that is the symmetry acts independently at each point in space-time. Each such symmetry is the 
basis of a gauge theory and requires the introduction of its own gauge bosons. 

• global symmetries, which are symmetries whose operations must be simultaneously applied to all points of 
space-time. 

QCD is a gauge theory of the SU(3) gauge group obtained by taking the color charge to define a local symmetry. 

Since the strong interaction does not discriminate between different flavors of quark, QCD has approximate flavor 
symmetry, which is broken by the differing masses of the quarks. 

There are additional global symmetries whose definitions require the notion of chirality, discrimination between left 
and right-handed. If the spin of a particle has a positive projection on its direction of motion then it is called 
left-handed; otherwise, it is right-handed. Chirality and handedness are not the same, but become approximately 
equivalent at high energies. 

• Chiral symmetries involve independent transformations of these two types of particle. 

• Vector symmetries (also called diagonal symmetries) mean the same transformation is applied on the two 
chiralities. 

• Axial symmetries are those in which one transformation is applied on left-handed particles and the inverse on the 
right-handed particles. 

Additional remarks: duality 

As mentioned, asymptotic freedom means that at large energy - this corresponds also to short distances - there is 
practically no interaction between the particles. This is in contrast - more precisely one would say: dual - to what one 
is used to, since usually one connects the absence of interactions with large distances. However, as already 
mentioned in the original paper of Franz WegnerJ 9 ^ a solid state theorist who introduced 1971 simple gauge 
invariant lattice models, the high-temperature behaviour of the original model, e.g. the strong decay of correlations 
at large distances, corresponds to the low-temperature behaviour of the (usually ordered!) dual model, namely the 
asymptotic decay of non-trivial correlations, e.g. short-range deviations from almost perfect arrangements, for short 
distances. Here, in contrast to Wegner, we have only the dual model, which is that one described in this article/ 10 ^ 

Symmetry groups 

The color group SU(3) corresponds to the local symmetry whose gauging gives rise to QCD. The electric charge 
labels a representation of the local symmetry group U(l) which is gauged to give QED: this is an abelian group. If 
one considers a version of QCD with N f flavors of massless quarks, then there is a global (chiral) flavor symmetry 
group SU L (N f ) X SU R (N f ) X U B (1) X U A (1). The chiral symmetry is spontaneously broken by the QCD 
vacuum to the vector (L+R) SUy (Nf) with the formation of a chiral condensate. The vector symmetry, Uq(1) 
corresponds to the baryon number of quarks and is an exact symmetry. The axial symmetry U^(l)is exact in the 
classical theory, but broken in the quantum theory, an occurrence called an anomaly. Gluon field configurations 
called instantons are closely related to this anomaly. 

There are two different types of SU(3) symmetry: there is the symmetry that acts on the different colors of quarks, 
and this is an exact gauge symmetry mediated by the gluons, and there is also a flavor symmetry which rotates 
different flavors of quarks to each other, or flavor SU(3). Flavor SU(3) is an approximate symmetry of the vacuum of 
QCD, and is not a fundamental symmetry at all. It is an accidental consequence of the small mass of the three 
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lightest quarks. 

In the QCD vacuum there are vacuum condensates of all the quarks whose mass is less than the QCD scale. This 
includes the up and down quarks, and to a lesser extent the strange quark, but not any of the others. The vacuum is 
symmetric under SU(2) isospin rotations of up and down, and to a lesser extent under rotations of up, down and 
strange, or full flavor group SU(3), and the observed particles make isospin and SU(3) multiplets. 

The approximate flavor symmetries do have associated gauge bosons, observed particles like the rho and the omega, 
but these particles are nothing like the gluons and they are not massless. They are emergent gauge bosons in an 
approximate string description of QCD. 

Lagrangian 

The dynamics of the quarks and gluons are controlled by the quantum chromodynamics Lagrangian. The gauge 
invariant QCD Lagrangian is 

£ QCD = * tyfiP^ - m <y 4, - Jg^gt 

where ^(x) is the quark field, a dynamical function of space-time, in the fundamental representation of the SU(3) 
gauge group, indexed by i, j 5 . . .; G^(:r)are the gluon fields, also a dynamical function of space-time, in the 
adjoint representation of the SU(3) gauge group, indexed by a, 6, ... . The 7^ are Dirac matrices connecting the 
spinor representation to the vector representation of the Lorentz group; and Tjj are the generators connecting the 

fundamental, antifundamental and adjoint representations of the SU(3) gauge group. The Gell-Mann matrices 
provide one such representation for the generators. 

The symbol G^ v represents the gauge invariant gluonic field strength tensor, analogous to the electromagnetic field 
strength tensor, , in Electrodynamics. It is given by 

G% = d^G* - d v G; - gf^Gffi , 
where f abc are the structure constants of SU(3). Note that the rules to move-up or pull-down the a, b, or c indexes 

are trivial, (+ +), so that f abc = f abc = f£ c } whereas for the fi or v indexes one has the non-trivial relativistic 

rules, corresponding e.g. to the signature (+ — ). Furthermore, for mathematicians, according to this formula the 
gluon colour field can be represented by a SU(3)-Lie algebra-valued " curvature M -2-form G = dG — g G A G , 

where G* s a " vector potential"- 1 -form corresponding to G an d A is the (antisymmetric) "wedge product" of this 
algebra, producing the "structure constants" f abc . The Cartan-derivative of the field form (i.e. essentially the 
divergence of the field) would be zero in the absence of the "gluon terms", i.e. those ~ g, which represent the 

*Kie cons^fanfs^m ari§ °<? contrH^me quark mass and coupling constants of the theory, subject to renormalization in 
the full quantum theory. 

An important theoretical notion concerning the final term of the above Lagrangian is the Wilson loop variable. This 
loop variable plays a most-important role in discretized forms of the QCD (see lattice QCD), and more generally, it 
distinguishes confined and deconfined states of a gauge theory. It was introduced by the Nobel prize winner Kenneth 
G. Wilson and is treated in a separate article. 
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Fields 

Quarks are massive spin- 1/2 fermions which carry a color charge whose gauging is the content of QCD. Quarks are 
represented by Dirac fields in the fundamental representation 3 of the gauge group SU(3). They also carry electric 
charge (either -1/3 or 2/3) and participate in weak interactions as part of weak isospin doublets. They carry global 
quantum numbers including the baryon number, which is 1/3 for each quark, hypercharge and one of the flavor 
quantum numbers. 

Gluons are spin-1 bosons which also carry color charges, since they lie in the adjoint representation 8 of SU(3). They 
have no electric charge, do not participate in the weak interactions, and have no flavor. They lie in the singlet 
representation 1 of all these symmetry groups. 

Every quark has its own antiquark. The charge of each antiquark is exactly the opposite of the corresponding quark. 
Dynamics 

According to the rules of quantum field theory, and the associated Feynman diagrams, the above theory gives rise to 
three basic interactions: a quark may emit (or absorb) a gluon, a gluon may emit (or absorb) a gluon, and two gluons 
may directly interact. This contrasts with QED, in which only the first kind of interaction occurs, since photons have 
no charge. Diagrams involving Faddeev-Popov ghosts must be considered too. 

Area law and confinement 

Detailed computations with the above-mentioned Lagrangian^ 11 ^ show that the effective potential between a quark 

and its anti-quark in a meson contains a term OC r , which represents some kind of "stiffness" of the interaction 

between the particle and its anti-particle at large distances, similar to the entropic elasticity of a rubber band (see 

below). This leads to confinement of the quarks to the interiour of hadrons, i.e. mesons and nucleons, with typical 

ri3i 

radii R , corresponding to former "Bag models" of the hadrons . The order of magnitude of the "bag radius" is 1 
C -15 

fm (=10 m). Moreover, the above-mentioned stiffness is quantitatively related to the so-called "area law" 
behaviour of the expectation value of the Wilson loop product Pyyof the ordered coupling constants around a 
closed loop W; i.e. (Pw) * s proportional to the area enclosed by the loop. For this behaviour the non-abelian 
behaviour of the gauge group is essential. 

Methods 

Further analysis of the content of the theory is complicated. Various techniques have been developed to work with 
QCD. Some of them are discussed briefly below. 

Perturbative QCD 

This approach is based on asymptotic freedom, which allows perturbation theory to be used accurately in 
experiments performed at very high energies. Although limited in scope, this approach has resulted in the most 
precise tests of QCD to date. 

Lattice QCD 

Among non-perturbative approaches to QCD, the most well established one is lattice QCD. This approach uses a 
discrete set of space-time points (called the lattice) to reduce the analytically intractable path integrals of the 
continuum theory to a very difficult numerical computation which is then carried out on supercomputers like the 
QCDOC which was constructed for precisely this purpose. While it is a slow and resource-intensive approach, it has 
wide applicability, giving insight into parts of the theory inaccessible by other means. However, the numerical sign 
problem makes it difficult to use lattice methods to study QCD at high density and low temperature (e.g. nuclear 
matter or the interior of neutron stars). 
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1/N expansion 

A well-known approximation scheme, the 1/N expansion, starts from the premise that the number of colors is 
infinite, and makes a series of corrections to account for the fact that it is not. Until now it has been the source of 
qualitative insight rather than a method for quantitative predictions. Modern variants include the AdS/CFT approach. 

Effective theories 

For specific problems effective theories may be written down which give qualitatively correct results in certain 
limits. In the best of cases, these may then be obtained as systematic expansions in some parameter of the QCD 
Lagrangian. One such effective field theory is chiral perturbation theory or ChiPT, which is the QCD effective 
theory at low energies. More precisely, it is a low energy expansion based on the spontaneus chiral symmetry 
breaking of QCD, which is an exact symmetry when quark masses are equal to zero, but for the u,d and s quark, 
which have small mass, it is still a good approximate symmetry. Depending on the number of quarks which are 
treated as light, one uses either SU(2) ChiPT or SU(3) ChiPT . Other effective theories are heavy quark effective 
theory (which expands around heavy quark mass near infinity), and soft-collinear effective theory (which expands 
around large ratios of energy scales). In addition to effective theories, models like the Nambu-Jona-Lasinio model 
and the chiral model are often used when discussing general features. 

QCD Sum Rules 

Based on an Operator product expansion one can derive sets of relations that connect different observables with each 
other. 

Experimental tests 

The notion of quark flavours was prompted by the necessity of explaining the properties of hadrons during the 
development of the quark model. The notion of colour was necessitated by the puzzle of the A ++ . This has been dealt 
with in the section on the history of QCD. 

The first evidence for quarks as real constituent elements of hadrons was obtained in deep inelastic scattering 
experiments at SLAC. The first evidence for gluons came in three jet events at PETRA. 

Good quantitative tests of perturbative QCD are 

• the running of the QCD coupling as deduced from many observations 

• scaling violation in polarized and unpolarized deep inelastic scattering 

• vector boson production at colliders (this includes the Drell-Yan process) 

• jet cross sections in colliders 

• event shape observables at the LEP 

• heavy-quark production in colliders 

Quantitative tests of non-perturbative QCD are fewer, because the predictions are harder to make. The best is 
probably the running of the QCD coupling as probed through lattice computations of heavy-quarkonium spectra. 
There is a recent claim about the mass of the heavy meson B c [14]. Other non-perturbative tests are currently at the 
level of 5% at best. Continuing work on masses and form factors of hadrons and their weak matrix elements are 
promising candidates for future quantitative tests. The whole subject of quark matter and the quark-gluon plasma is a 
non-perturbative test bed for QCD which still remains to be properly exploited. 
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Cross-relations to Solid State Physics 

There are unexpected cross-relations to solid state physics. For example, the notion of gauge invariance forms the 
basis of the well-known Mattis spin glasses J 15 ^ which are systems with the usual spin degrees of freedom Si = ±1 
for /=1,...,N, with the special fixed "random" couplings J^ = Jo e^. Here the e. and e fc quantities can 
independently and "randomly" take the values ±1 , which corresponds to a most- simple gauge transformation 
(si — > Si • 6i Jik — > £iJi,k£k s k —> s k ' £k) - This means that thermodynamic expectation values of 
measurable quantities, e.g. of the energy *)-(,:= — ^ ^ s { J i k s fc , are invariant. 

However, here the coupling degrees of freedom Ji^ , which in the QCD correspond to the gluons, are "frozen" to 
fixed values (quenching). In contrast, in the QCD they "fluctuate" (annealing), and through the large number of 
gauge degrees of freedom the entropy plays an important role (see below). 

For positive Jo the thermodynamics of the Mattis spin glass corresponds in fact simply to a ferromagnet, just 
because these systems have no "frustration" at all. This term is a basic measure in spin glass theory J 16 ^ 
Quantitatively it is identical with the loop-product Pw ' — Ji,kJk,l- ■ ■ Jn.mJTn.i along a closed loop W. However, 

for a Mattis spin glass - in contrast to "genuine" spin glasses - the quantity never becomes negative. 
The basic notion "frustration" of the spin-glass is actually similar to the Wilson loop quantity of the QCD. The only 
difference is again that in the QCD one is dealing with SU(3) matrices, and that one is dealing with a "fluctuating" 
quantity. Energetically, perfect absence of frustration should be non-favorable and untypical for a spin glass, which 
means that one should add the loop-product to the Hamiltonian, by some kind of term representing a "punishment". - 
In the QCD the Wilson loop is essential for the Lagrangian rightaway. 

The relation between the QCD and "disordered magnetic systems" (the spin glasses belong to them) were 

ri7i 

additionally stressed in a paper by Fradkin, Huberman und Shenker, which also stresses the notion of duality. 

A further analogy consists in the already mentioned similarity to polymer physics, where, analogously to Wilson 

Loops, so-called "entangled nets" appear, which are important for the formation of the entropy-elasticity (force 

proportional to the length) of a rubber band. The non-abelian character of the SU(3) corresponds thereby to the 

non-trivial "chemical links", which glue different loop segments together, and "asymptotic freedom" means in the 

polymer analogy simply the fact that in the short-wave limit, i.e. for 0 <— X w <^ R c (where is a characteristic 

correlation-length for the glued loops, corresponding to the above-mentioned "bag radius", while X is the 

ri8i w 

wavelength of an excitation) any non-trivial correlation vanishes totally, as if the system had crystallized. 

There is also a correspondence between confinement in QCD - the fact that the colour-field is only different from 

zero in the interiour of hadrons - and the behaviour of the usual magnetic field in the theory of type-II 

ri9i 

superconductors: there the magnetism is confined to the interiour of the Abrikosov flux-line lattice, i.e., the 
London penetration depth A of that theory is analogous to the confinement radius of quantum chromodynamics. 
Mathematically, this correspondendence is supported by the second term, oc gG < j 1 'ipi'y fJ 'T?j'ifij ,on the r.h.s. of the 

Lagrangian. 
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External links 
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• The millennium prize (http://www.claymath.org/millennium/) for proving confinement (http://www. 
claymath. org/millennium/) 

• Ab Initio Determination of Light Hadron Masses (http://www.sciencemag.org/cgi/content/abstract/322/ 
5905/1224) 

• Andreas S Kronfeld (http://www.sciencemag.org/cgi/content/summary/322/5905/1198) The Weight of the 
World Is Quantum Chromodynamics 

• Andreas S Kronfeld (http://www.iop.Org/EJ/article/1742-6596/125/l/012067/jpconf8_125_012067. 
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• Quantum Chromodynamics (http ://arxiv. org/abs/hepph/950523 1 ) 

Quantum Geometry 

In theoretical physics, quantum geometry is the set of new mathematical concepts generalizing the concepts of 
geometry whose understanding is necessary to describe the physical phenomena at very short distance scales 
(comparable to Planck length). At these distances, quantum mechanics has a profound effect on physics. 

Each theory of quantum gravity uses the term quantum geometry in a slightly different fashion. String theory, a 
leading candidate for a quantum theory of gravity, uses the term quantum geometry to describe exotic phenomena 
such as T-duality and other geometric dualities, mirror symmetry, topology-changing transitions, minimal possible 
distance scale, and other effects that challenge our usual geometrical intuition. More technically, quantum geometry 
refers to the shape of the spacetime manifold as seen by D-branes which includes the quantum corrections to the 
metric tensor, such as the worldsheet instantons. For example, the quantum volume of a cycle is computed from the 
mass of a brane wrapped on this cycle. 

In an alternative approach to quantum gravity called loop quantum gravity (LQG), the phrase quantum geometry 
usually refers to the formalism within LQG where the observables that capture the information about the geometry 
are now well defined operators on a Hilbert space. In particular, certain physical observables, such as the area, have a 
discrete spectrum. It has also been shown that the loop quantum geometry is non-commutative. 

It is possible (but considered unlikely) that this strictly quantized understanding of geometry will be consistent with 
the quantum picture of geometry arising from string theory. 

Another, quite successful, approach, which tries to reconstruct the geometry of space-time from "first principles" is 
Discrete Lorentzian quantum gravity. 

External links 

• Space and Time: From Antiquity to Einstein and Beyond ^ 

• Quantum Geometry and its Applications 

• Hypercomplex Numbers in Geometry and Physics 
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Loop Quantum Gravity 

Loop quantum gravity (LQG), also known as loop gravity and quantum geometry, is a proposed quantum theory 
of spacetime which attempts to reconcile the theories of quantum mechanics and general relativity. Loop quantum 
gravity suggests that space can be viewed as an extremely fine fabric or network "woven" of finite quantised loops of 
excited gravitational fields called spin networks. When viewed over time, these spin networks are called spin foam, 
which should not be confused with quantum foam. A major quantum gravity contender with string theory, loop 
quantum gravity incorporates general relativity without requiring string theory's higher dimensions. 

LQG preserves many of the important features of general relativity, while simultaneously employing quantization of 
both space and time at the Planck scale in the tradition of quantum mechanics. The technique of loop quantization 
was developed for the nonperturbative quantization of diffeomorphism-invariant gauge theory. Roughly, LQG tries 
to establish a quantum theory of gravity in which the very space itself, where all other physical phenomena occur, 
becomes quantized. 

LQG is one of a family of theories called canonical quantum gravity. The LQG theory also includes matter and 
forces, but does not address the problem of the unification of all physical forces the way some other quantum gravity 
theories such as string theory do. 

History of LQG 

In 1986, Abhay Ashtekar reformulated Einstein's field equations of general relativity, using what have come to be 
known as Ashtekar variables, a particular flavor of Einstein-Cartan theory with a complex connection. In 1988, Carlo 
Rovelli and Lee Smolin used this formalism to introduce the loop representation of quantum general relativity, 
which was soon developed by Ashtekar, Rovelli, Smolin and many others. In the Ashtekar formulation, the 
fundamental objects are a rule for parallel transport (technically, a connection) and a coordinate frame (called a 
vierbein) at each point. Because the Ashtekar formulation was background-independent, it was possible to use 
Wilson loops as the basis for a nonperturbative quantization of gravity. Explicit (spatial) diffeomorphism invariance 
of the vacuum state plays an essential role in the regularization of the Wilson loop states. 

Around 1990, Rovelli and Smolin obtained an explicit basis of states of quantum geometry, which turned out to be 
labelled by Roger Penrose's spin networks, and showed that the geometry is quantized, that is, the 
(non-gauge-invariant) quantum operators representing area and volume have a discrete spectrum. In this context, 
spin networks arose as a generalization of Wilson loops necessary to deal with mutually intersecting loops. 
Mathematically, spin networks are related to group representation theory and can be used to construct knot invariants 
such as the Jones polynomial. 

Key concepts of loop quantum gravity 

In the framework of quantum field theory, and using the standard techniques of perturbative calculations, one finds 
that gravitation is non-renormalizable in contrast to the electroweak and strong interactions of the Standard Model of 
particle physics. This implies that there are infinitely many free parameters in the theory and thus that it cannot be 
predictive. 

In general relativity, the Einstein field equations assign a geometry (via a metric) to space-time. Before this, there is 
no physical notion of distance or time measurements. In this sense, general relativity is said to be background 
independent. An immediate conceptual issue that arises is that the usual framework of quantum mechanics, including 
quantum field theory, relies on a reference (background) space-time. Therefore, one approach to finding a quantum 
theory of gravity is to understand how to do quantum mechanics without relying on such a background; this is the 
approach of the canonical quantization/loop quantum gravity/spin foam approaches. 
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Starting with the initial-value-formulation of general 
relativity (cf. the section on General 
relativity#Evolution equations), the result is an 
analogue of the Schrodinger equation called the 
Wheeler-deWitt equation, which some argue is 
ill-defined A major break-through came with the 
introduction of what are now known as Ashtekar 
variables, which represent geometric gravity using 
mathematical analogues of electric and magnetic 
fields. The resulting candidate for a theory of 
quantum gravity is Loop quantum gravity, in which 
space is represented by a network structure called a 



spin network, evolving over time in discrete steps. 



[3] 




Simple spin network of the type used in loop quantum gravity 



Though not proven, it may be impossible to quantize 
gravity in 3+1 dimensions without creating matter and 

energy artifacts. Should LQG succeed as a quantum theory of gravity, the known matter fields will have to be 
incorporated into the theory a posteriori. Many of the approaches now being actively pursued (by Renate Loll, Jan 
Ambj0rn, Lee Smolin, Sundance Bilson-Thompson, Laurent Freidel, Mark B. Wise and others ^ ) combine matter 
with geometry. 

The main successes of loop quantum gravity are: 

1. It is a nonperturbative quantization of 3-space geometry, with quantized area and volume operators. 

2. It includes a calculation of the entropy of black holes. 

3. It replaces the Big Bang spacetime singularity with a Big Bounce. 

These claims are not universally accepted among the physics community, which is presently divided between 
different approaches to the problem of quantum gravity. LQG may possibly be viable as a refinement of either 
gravity or geometry. Many of the core results are rigorous mathematical physics; their physical interpretations 
remain speculative. Three speculative physical interpretations of LQG's core mathematical results are loop 
quantization, Lorentz invariance, General covariance and background independence, discussed below. Another 
physical test for LQG is to reproduce the physics of general relativity coupled with quantum field theory, discussed 
under problems. 



Loop quantization 

At the core of loop quantum gravity is a framework for nonperturbative quantization of diffeomorphism-invariant 
gauge theories, which one might call loop quantization. While originally developed in order to quantize vacuum 
general relativity in 3+1 dimensions, the formalism can accommodate arbitrary spacetime dimensionalities, 
fermions,^ an arbitrary gauge group (or even quantum group), and super symmetry/ 6 ^ and results in a quantization 
of the kinematics of the corresponding diffeomorphism-invariant gauge theory. Much work remains to be done on 
the dynamics, the classical limit and the correspondence principle, all of which are necessary in one way or another 
to make contact with experiment. 

In a nutshell, loop quantization is the result of applying C*-algebraic quantization to a non-canonical algebra of 
gauge-invariant classical observables. Non-canonical means that the basic observables quantized are not generalized 
coordinates and their conjugate momenta. Instead, the algebra generated by spin network observables (built from 
holonomies) and field strength fluxes is used. 

Loop quantization techniques are particularly successful in dealing with topological quantum field theories, where 
they give rise to state-sum/spin-foam models such as the Turaev-Viro model of 2+1 dimensional general relativity. A 
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much studied topological quantum field theory is the so-called BF theory in 3+1 dimensions. Since classical general 
relativity can be formulated as a BF theory with constraints, scientists hope that a consistent quantization of gravity 
may arise from the perturbation theory of BF spin-foam models. 

Lorentz in variance 

LQG is a quantization of a classical Lagrangian field theory which is equivalent to the usual Einstein-Cartan theory 
in that it leads to the same equations of motion describing general relativity with torsion. As such, it can be argued 
that LQG respects local Lorentz invariance. Global Lorentz invariance is broken in LQG just as in general relativity. 
A positive cosmological constant can be realized in LQG by replacing the Lorentz group with the corresponding 
quantum group. 

General covariance and background independence 

General covariance, also known as "diffeomorphism invariance", is the invariance of physical laws under arbitrary 

coordinate transformations. An example of this are the equations of general relativity, where this symmetry is one of 

the defining features of the theory. LQG preserves this symmetry by requiring that the physical states remain 

invariant under the generators of diffeomorphisms. The interpretation of this condition is well understood for purely 

spatial diffemorphisms. However, the understanding of diffeomorphisms involving time (the Hamiltonian constraint) 

T71 

is more subtle because it is related to dynamics and the so-called problem of time in general relativity. A generally 
accepted calculational framework to account for this constraint is yet to be found ^ 

Whether or not Lorentz invariance is broken in the low-energy limit of LQG, the theory is formally background 
independent. The equations of LQG are not embedded in, or presuppose, space and time, except for its invariant 
topology. Instead, they are expected to give rise to space and time at distances which are large compared to the 
Planck length. 



Problems 

While there has been a recent proposal relating to observation of naked singularities, tl0] and doubly special 
relativity, as a part of a program called loop quantum cosmology, as of now there is no experimental observation for 
which loop quantum gravity makes a prediction not made by the Standard Model or general relativity (a problem that 
plagues all current theories of quantum gravity). 

Making predictions from the theory of LQG has been extremely difficult computationally, also a recurring problem 
with modern theories in physics. 

Another problem is that a crucial free parameter in the theory known as the Immirzi parameter can only be computed 
by demanding agreement with Bekenstein and Hawking's calculation of the black hole entropy. Loop quantum 
gravity predicts that the entropy of a black hole is proportional to the area of the event horizon, but does not obtain 
the Bekenstein-Hawking formula S = A/4 unless the Immirzi parameter is chosen to give this value. A prediction 
directly from theory would be preferable. 

Presently, no semiclassical limit recovering general relativity has been shown to exist. This means it remains 
unproven that LQG's description of spacetime at the Planck scale has the right continuum limit, described by general 
relativity with possible quantum corrections. Specifically, the dynamics of the theory is encoded in the Hamiltonian 
constraint, but there is no candidate Hamiltonian (quantum mechanics). Other technical problems includes 
finding off-shell closure of the constraint algebra and physical inner product vector space, coupling to matter fields 
Quantum field theory, fate of the Renormalization of the graviton in Perturbation theory that lead to Ultraviolet 
divergence beyond 2-loops One-loop Feynman diagram in Feynman diagram. . The fate of Lorentz invariance in 
loop quantum gravity remains an open problem. ^ 13 ^ 
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Current LQG research directions attempt to address these known problems, and includes spinfoam models and 
entropic gravity tl5] . 

See also 

• Loop quantum cosmology 

• Heyting algebra 

• Category theory 

• Noncommutative geometry 

• Topos theory 

• C*-algebra 

• Regge calculus 

• Double special relativity 

• Lorentz in variance in loop quantum gravity 

• Immirzi parameter 

• Invariance mechanics 

• spin foam 

• group field theory 

• string-net 

• Kodama state 

• supersymmetry 

Notes 

[I] Cf. section 3 in Kuchaf 1973. 

[2] See Ashtekar 1986, Ashtekar 1987. 

[3] For a review, see Thiemann 2006; more extensive accounts can be found in Rovelli 1998, Ashtekar & Lewandowski 2004 as well as in the 

lecture notes Thiemann 2003. 
[4] See List of loop quantum gravity researchers 

[5] Baez, John, Krasnov,Kirill : Quantization of Diffeomorphism-Invariant Theories with Fermions arXiv:hep-th/97031 12 

[6] Ling,Yi; Smolin, Lee : Supersymmetric Spin Networks and Quantum Supergravity http://www.arxiv.org/abs/hep-th/9904016 

[7] See, e.g., Stuart Kauffman and Lee Smolin "A Possible Solution For The Problem Of Time In Quantum Cosmology" (1997). (http://www. 

edge . org/ 3rd_culture/ smolin/ smolin_p 1 . html) 
[8] See, e.g., Lee Smolin, "The Case for Background Independence", in Dean Rickles, et al. (eds.) The Structural Foundations of Quantum 

Gravity (2006), p 196 ff. 
[9] For a highly technical explanation, see, e.g., Carlo Rovelli (2004). Quantum Gravity, p 13 ff. 

[10] Goswami et al.. "Quantum evaporation of a naked singularity" (http://arxiv.org/abs/gr-qc/0506129). Physical Review Letters. . Retrieved 
2010-01-02. 

[II] http://arxiv.org/abs/hep-th/0501 1 14 
[12] http://arxiv.org/abs/hep-th/0501 1 14 

[13] "Testing Einstein's special relativity with Fermi's short hard gamma-ray burst GRB090510" (http://arxiv.org/abs/0908. 1832). Fermi 

GBM/LAT Collaborations. . Retrieved August 13, 2009. 
[ 1 4] http :// math. ucr . edu/home/baez/ week280 . html 

[15] Newtonian gravity in loop quantum gravity (http://arxiv.org/abs/1001.3668vl), Lee Smolin, 2010 
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Quantum Algebraic Topology 

In physics, topological order ^ is a new kind of order (a new kind of organization of particles) in a quantum state 
that is beyond the Landau symmetry-breaking description. It cannot be described by local order parameters and long 
range correlations. However, topological orders can be described by a new set of quantum numbers, such as ground 
state degeneracy, quasiparticle fractional statistics, edge states, topological entropy, etc. Roughly speaking, 
topological order is a pattern of long-range quantum entanglement in quantum states. States with different 
topological orders can change into each other only through a phase transition. 

Background 

Although all matter is formed by atoms, matter can have very different properties and appear in very different forms, 
such as solid, liquid, superfluid, magnet, etc. According to condensed matter physics and the principle of emergence, 
the different properties of materials originate from the different ways in which the atoms are organized in the 
materials. Those different organizations of the atoms (or other particles) are formally called the orders in the 
materials. 

Atoms can organize in many ways which lead to many different orders and many different types of materials. 
Landau symmetry-breaking theory provides a general understanding of these different orders. It points out that 
different orders really correspond to different symmetries in the organizations of the constituent atoms. As a material 
changes from one order to another order (i.e., as the material undergoes a phase transition), what happens is that the 
symmetry of the organization of the atoms changes. 

For example, atoms have a random distribution in a liquid, so a liquid remains the same as we displace it by an 
arbitrary distance. We say that a liquid has a continuous translation symmetry. After a phase transition, a liquid can 
turn into a crystal. In a crystal, atoms organize into a regular array (a lattice). A lattice remains unchanged only when 
we displace it by a particular distance (integer times of lattice constant), so a crystal has only discrete translation 
symmetry. The phase transition between a liquid and a crystal is a transition that reduces the continuous translation 
symmetry of the liquid to the discrete symmetry of the crystal. Such change in symmetry is called symmetry 
breaking. The essence of the difference between liquids and crystals is therefore that the organizations of atoms have 
different symmetries in the two phases. 

Landau symmetry-breaking theory is a very successful theory. For a long time, physicists believed that Landau 
symmetry-breaking theory describes all possible orders in materials, and all possible (continuous) phase transitions. 

The discovery and characterization of topological order 

However, since late 1980s, it has become gradually apparent that Landau symmetry-breaking theory may not 
describe all possible orders. In an attempt to explain high temperature superconductivity people introduced chiral 
spin state. ^ ^ At first, physicists still wanted to use Landau symmetry-breaking theory to describe the chiral spin 
state. They identified the chiral spin state as a state that breaks the time reversal and parity symmetries, but not the 
spin rotation symmetry. This should be the end of story according to Landau's symmetry breaking description of 
orders. However, it was quickly realized that there are many different chiral spin states that have exactly the same 



Quantum Algebraic Topology 



263 



symmetry, so symmetry alone was not enough to characterize different chiral spin states. This means that the chiral 

spin states contain a new kind of order that is beyond the usual symmetry description ^ . The proposed, new kind of 

order was named "topological order" ^ . (The name "topological order" is motivated by the low energy effective 

theory of the chiral spin states which is a topological quantum field theory (TQFT) ^ ^ ^ ). New quantum 

numbers, such as ground state degeneracy ^ and the non-Abelian Berry's phase of degenerate ground states . 

were introduced to characterize the different topological orders in chiral spin states. Recently, it was shown that 

ri2i r 1 3i 

topological orders can also be characterized by topological entropy . 

But experiments soon indicated that chiral spin states do not describe high-temperature superconductors, and the 
theory of topological order became a theory with no experimental realization. However, the similarity between chiral 
spin states and quantum Hall states allows one to use the theory of topological order to describe different quantum 
Hall states. ^ Just like chiral spin states, different quantum Hall states all have the same symmetry and are beyond 
the Landau symmetry-breaking description. One finds that the different orders in different quantum Hall states can 
indeed be described by topological orders, so the topological order does have experimental realizations. 

Fractional quantum Hall (FQH) states were discovered in 1982 ^ ^ before the introduction of the concept of 
topological order. But FQH states are not the first experiemntally discovered topologically ordered states. 
Superconductors discovered in 1911 are the first, which have a Z2 topological order (note, however, that 
superconductivity does fall within the Ginzburg-Landau theory description of phase transitions - in fact the 
prediction of the vortex state in superocnductors was one of the main successes of Ginzburg-Landau theory). tl7] tl8] 



Mechanism of topological order 

ri9i 

A large class of topological orders is realized through a mechanism called string-net condensation . This class of 
topological orders can be classified by utilizing tensor category (or monoidal category) theory. One finds that 
string-net condensation can generate infinitely many different types of topological orders, which may indicate that 
there are many different new types of materials remaining to be discovered. 

The collective motions of condensed strings give rise to excitations above the string-net condensed states. Those 
excitations turn out to be gauge bosons. The ends of strings are defects which correspond to another type of 
excitations. Those excitations are the gauge charges and can carry Fermi or fractional statistics. t20] 

The condensations of other extended objects such as "membranes", 1 "brane-nets" , L and fractals also lead to 
topologically ordered phases ^ and "quantum glassiness" ^ 



Mathematical foundation of topological order 

We know that group theory is the mathematical foundation of symmetry breaking orders. What is the mathematical 
foundation of topological order? The string-net condensation suggests that tensor category (or monoidal category) 
theory may be the mathematical foundation of topological order. Quantum operator algebra is a very important 
mathematical tool in studying topological orders. Some also suggest that topological order is mathematically 
described by extended quantum symmetry 



Applications 

The materials described by Landau symmetry-breaking theory have had a substantial impact on technology. For 
example, Ferromagnetic materials that break spin rotation symmetry can be used as the media of digital information 
storage. A hard drive made of ferromagnetic materials can store gigabytes of information. Liquid crystals that break 
the rotational symmetry of molecules find wide application in display technology; nowadays one can hardly find a 
household without a liquid crystal display somewhere in it. Crystals that break translation symmetry lead to well 
defined electronic bands which in turn allow us to make semiconducting devices such as transistors. 
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Different types of Topologically orders are even richer than different types of symmetry-breaking orders. This 
suggests their potential for exciting, novel applications. 

One theorized application would be to use topologically ordered states as media for quantum computing in a 
technique known as topological quantum computing. A topologically ordered state is a state with complicated 
non-local quantum entanglement. The non-locality means that the quantum entanglement in a topologically ordered 
state is distributed among many different particles. As a result, the pattern of quantum entanglements cannot be 
destroyed by local perturbations. This significantly reduces the effect of decoherence. This suggests that if we use 
different quantum entanglements in a topologically ordered state to encode quantum information, the information 
may last much longer. ^ The quantum information encoded by the topological quantum entanglements can also be 
manipulated by dragging the topological defects around each other. This process may provide a physical apparatus 
for performing quantum computations. Therefore, topologically ordered states may provide natural media for 
both quantum memory and quantum computation. Such realizations of quantum memory and quantum computation 

T281 

may potentially be made fault tolerant. 

Topologically ordered states in general have a special property that they contain non-trivial boundary states. In many 

cases, those boundary states become perfect conducting channel that can conduct electricity without generating 
T291 

heat. LZ - 7J This can be another potential application of topological order in electronic devices. For example, the 
topological insulator ^ ^ is a simple example of topological order. The boundary states of topological insulator 
play a key role in the detection and the application of topological insulators. 



Potential impact 

Why is topological order important? Landau symmetry-breaking theory is a cornerstone of condensed matter 

physics. It is used to define the territory of condensed matter research. The existence of topological order appears to 

indicate that nature is much richer than Landau symmetry-breaking theory has so far indicated. The exciting time of 

condensed matter physics is still ahead of us. Some suggest that topological order (or more precisely, string-net 

T321 

condensation) and the local bosonic (spin) models have the potential to provide a unified origin for photons, 

T331 

electrons and other elementary particles in our universe. 
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Commutativity 

In mathematics an operation is commutative if 

changing the order of the operands does not 
change the end result. It is a fundamental 
property of many binary operations, and many 
mathematical proofs depend on it. The 
commutativity of simple operations, such as 
multiplicaton and addition of numbers, was for 
many years implicitly assumed and the property 
was not named until the 19th century when 
mathematics started to become formalized. By 
contrast, division and subtraction are not 
commutative. 

Common uses 

The commutative property (or commutative law) is a property associated with binary operations and functions. 
Similarly, if the commutative property holds for a pair of elements under a certain binary operation then it is said that 
the two elements commute under that operation. 

In group and set theory, many algebraic structures are called commutative when certain operands satisfy the 
commutative property. In higher branches of mathematics, such as analysis and linear algebra the commutativity of 

well known operations (such as addition and multiplication on real and complex numbers) is often used (or implicitly 

A \ • -p [1] [2] [3] 
assumed) m proors. 

Mathematical definitions 

The term "commutative" is used in several related senses ^ 

1. A binary operation * on a set S is said to be commutative if: 

Vx, y£S:x*y = y*x 

- An operation that does not satisfy the above property is called noncommutative. 

2. One says that x commutes with y under * if: 

x * y = y * x 

3. A binary function f.AxA — > B is said to be commutative if: 

Vx,y G A : f(z,y) = f(y,x) 




Example showing the commutativity of addition (3 + 2 = 2 + 3) 
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History and etymology 



f ct f j sont telles qu'elles dcmnent 
el que soit Fordre dans lequel on les> 
ppelees commutatives entre elles* 

; aRz=JLaz ; 

The first known use of the term was in a French Journal published in 
1814 



Records of the implicit use of the commutative 
property go back to ancient times. The Egyptians used 
the commutative property of multiplication to simplify 
computing products ^ Euclid is known to have 
assumed the commutative property of multiplication in 

roi 

his book Elements. Formal uses of the commutative 
property arose in the late 18th and early 19th centuries, 
when mathematicians began to work on a theory of 
functions. Today the commutative property is a well 
known and basic property used in most branches of 
mathematics. 

The first recorded use of the term commutative was in a memoir by Francois Servois in 18 14,^ ^ which used the 
word commutatives when describing functions that have what is now called the commutative property. The word is a 
combination of the French word commuter meaning "to substitute or switch" and the suffix -ative meaning "tending 
to" so the word literally means "tending to substitute or switch." The term then appeared in English in Philosophical 



Transactions of the Royal Society in 1844 



[9] 



Related properties 



Associativity 

The associative property is closely related to the commutative 
property. The associative property of an expression containing two 
or more occurrences of the same operator states that the order in 
which operations are performed does not affect the final result, as 
long as the order of terms is not changed. In contrast, the 
commutative property states that the order of the terms does not 
affect the final result. 




Graph showing the symmetry of the addition function 



Symmetry can be directly linked to commutativity. When a 

commutative operator is written as a binary function then the resulting function is symmetric across the line y = x. 
As an example, if we let a function / represent addition (a commutative operation) so that f(x,y) = x + y then / is a 
symmetric function which can be seen in the image on the right. 

For binary relations, a symmetric relation is analogous to a commutative operation, in that if a relation R is 
symmetric, then a Rb <=> bRa • 
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Examples 

Commutative operations in everyday life 

• Putting on socks resembles a commutative operation, since which sock is put on first is unimportant. Either way, 
the end result (having both socks on), is the same. 

• The commutativity of addition is observed when paying for an item with cash. Regardless of the order in which 
the bills handed over, they always give the same total. 

Commutative operations in mathematics 

Two well-known examples of commutative binary operations are:^ 

• The addition of real numbers, which is commutative since 

V(y, z) £R:y + z = z + y 
For example 4 + 5 = 5 + 4, since both expressions equal 9. 

• The multiplication of real numbers, which is commutative since 

Vy, z G R : yz = zy 
For example, 3x5 = 5x3, since both expressions equal 15. 

• Further examples of commutative binary operations include addition and multiplication of complex numbers, 
addition and scalar multiplication of vectors, and intersection and union of sets. 

Noncommutative operations in everyday life 

• Concatenation, the act of joining character strings together, is a noncommutative operation. For example 

EA + T = EAT / TEA = T + EA 

• Washing and drying clothes resembles a noncommutative operation; washing and then drying produces a 
markedly different result to drying and then washing. 

• The twists of the Rubik's Cube are noncommutative. This is studied in group theory. 

Noncommutative operations in mathematics 

• Subtraction is noncommutative since 0 — 1^1 — 0 

• Division is noncommutative since 1/2 ^ 2/1 

• Infinite addition is not (necessarily) commutative: 

1-1 + 1-1 + 1-1 + 1-1 + .. .<1 

whereas 

1 + 1- 1 + 1 + 1 -1 + 1 + 1- l + ... = oc 

• Matrix multiplication is noncommutative since 
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• The vector product (or cross product) of two vectors in three dimensions is anti-commutative, i.e., b x a = (a x 
b). 

Some noncommutative binary operations are:^ 11] 
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Mathematical structures and commutativity 

• A commutative semigroup is a set endowed with a total, associative and commutative operation. 

• If the operation additionally has an identity element, we have a commutative monoid 

T21 

• An abelian group, or commutative group is a group whose group operation is commutative. 

• A commutative ring is a ring whose multiplication is commutative. (Addition in a ring is always 
commutative y 1 2 ^ 

ri3i 

• In a field both addition and multiplication are commutative. 



Non-commuting operators in quantum mechanics 

In quantum mechanics as formulated by Schrodinger, physical variables are represented by linear operators such as x 
(meaning multiply by x), and d/dx. These two operators do not commute as may be seen by considering the effect of 
their products x (d/dx) and (d/dx) x on a one-dimensional wave function ip(x): 

d d 

x—ij) = xip 7^ ——xip = ip + xip 
dx dx 

According to the uncertainty principle of Heisenberg, if the two operators representing a pair of variables do not 
commute, then that pair of variables are mutually complementary which means that they cannot be simultaneously 
measured or known precisely. For example, the position and the linear momentum of a particle are represented 
respectively (in the x-direction) by the operators x and (h/2jti)d/dx (where h is Planck's constant). This is the same 
example except for the constant (h/2jti), so again the operators do not commute and the physical meaning is that the 
position and linear momentum in a given direction are complementary. 



Notes 

[I] Axler,p.2 
[2] Gallian,p.34 
[3] p. 26,87 

[4] Krowne, p. 1 

[5] Weisstein, Commute, p.l 

[6] Lumpkin, p. 11 

[7] Gay and Shute, p.? 

[8] O'Conner and Robertson, Real Numbers 

[9] Cabillon and Miller, Commutative and Distributive 

[10] O'Conner and Robertson, Servois 

[II] Yark,p.l 
[12] Gallianp.236 
[13] Gallianp.250 
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Noncommutative quantum field theory 

In mathematical physics, noncommutative quantum field theory (or quantum field theory on noncommutative 
spacetime) is an application of noncommutative mathematics to the spacetime of quantum field theory that is an 
outgrowth of noncommutative geometry and index theory in which the coordinate functions 1 ^ are noncommutative. 
One commonly studied version of such theories has the "canonical" commutation relation: 

which means that (with any given set of axes), it is impossible to accurately measure the position of a particle with 
respect to more than one axis. In fact, this leads to an uncertainty relation for the coordinates analogous to the 
Heisenberg uncertainty principle. 

Various lower limits have been claimed for the noncommutative scale, (i.e. how accurately positions can be 
measured) but there is currently no experimental evidence in favour of such theory or grounds for ruling them out. 

One of the novel features of noncommutative field theories is the UV/IR mixing phenomenon in which the physics 
at high energies affects the physics at low energies which does not occur in quantum field theories in which the 
coordinates commute. 

Other features include violation of Lorentz in variance due to the preferred direction of noncommutativity. 
Relativistic in variance can however be retained in the sense of twisted Poincare in variance of the theory . The 
causality condition is modified from that of the commutative theories. 

History and motivation 

Heisenberg was the first to suggest extending noncommutativity to the coordinates as a possible way of removing the 
infinite quantities appearing in field theories before the renormalization procedure was developed and had gained 
acceptance. The first paper on the subject was published in 1947 by Hartland Snyder. The success of the 
renormalization method resulted in little attention being paid to the subject for some time. In the 1980s, 
mathematicians, most notably Alain Connes, developed noncommutative geometry. Among other things, this work 
generalized the notion of differential structure to a noncommutative setting. This led to an operator algebraic 
description of noncommutative space-times, and the development of a Yang-Mills theory on a noncommutative 
torus. 

The particle physics community became interested in the noncommutative approach because of a paper by Nathan 
Seiberg and Edward Witten.^ They argued in the context of string theory that the coordinate functions of the 
endpoints of open strings constrained to a D-brane in the presence of a constant Neveu- Schwartz B-field - 
equivalent to a constant magnetic field on the brane — would satisfy the noncommutative algebra set out above. The 
implication is that a quantum field theory on noncommutative spacetime can be interpreted as a low energy limit of 
the theory of open strings. 

A paper by Sergio Doplicher, Klaus Fredenhagen and John Roberts ^ set out another motivation for the possible 
noncommutativity of space-time. Their arguments goes as follows: According to general relativity, when the energy 
density grows sufficiently large, a black hole is formed. On the other hand according to the Heisenberg uncertainty 
principle, a measurement of a space-time separation causes an uncertainty in momentum inversely proportional to 
the extent of the separation. Thus energy whose scale corresponds to the uncertainty in momentum is localized in the 
system within a region corresponding to the uncertainty in position. When the separation is small enough, the 
Schwarzschild radius of the system is reached and a black hole is formed, which prevents any information from 
escaping the system. Thus there is a lower bound for the measurement of length. A sufficient condition for 
preventing gravitational collapse can be expressed as an uncertainty relation for the coordinates. This relation can in 
turn be derived from a commutation relation for the coordinates. 
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Footnotes 

[1] It is possible to have a noncommuting time coordinate, but this causes many problems such as the violation of unitarity of the S-matrix. 
Hence most research is restricted to so-called "space-space" noncommutativity. There have been attempts to avoid these problems by 
redefining the perturbation theory. However, string theory derivations of noncommutative coordinates excludes time-space noncommutativity. 

[2] See, for example, Shiraz Minwalla, Mark Van Raamsdonk, Nathan Seiberg (2000) " Noncommutative Perturbative Dynamics, (http://arxiv. 
org/abs/hep-th/99 12072)" Journal of High Energy Physics, and Alec Matusis, Leonard Susskind, Nicolaos Toumbas (2000) " The IR/UV 
Connection in the Non- Commutative Gauge Theories, (http://arxiv.org/abs/hep-th/0002075)" Journal of High Energy Physics. 

[3] M. Chaichian, P. Presnajder, A. Tureanu (2005) " New concept of relativistic invariance in NC space-time: twisted Poincare symmetry and its 
implications, (http://arxiv.org/abs/hep-th/0409096)" Phys. Rev. Letters 94: . 

[4] Seiberg, N. and E. Witten (1999) " String Theory and Noncommutative Geometry, (http://arxiv.org/abs/hep-th/9908142)" Journal of High 
Energy Physics . 

[5] Sergio Doplicher, Klaus Fredenhagen, John E. Roberts (1995) " The quantum structure of spacetime at the Planck scale and quantum fields, 
(http://arxiv.org/abs/hep-th/0303037)" Commun. Math. Phys. 172: 187-220. 

Further reading 

• M.R. Douglas and N. A. Nekrasov (2001) " Noncommutative field theory, (http://prola.aps.org/abstract/RMP/ 
v73/i4/p977_l?qid=a81527af6e5a2fa2&qseq=l&show=10) M Rev. Mod. Phys. 73: 977 - 1029. 

• Szabo, R. J. (2003) " Quantum Field Theory on Noncommutative Spaces, (http://arxiv.org/abs/hep-th/ 
0109162)" Physics Reports 378: 207-99. An expository article on noncommutative quantum field theories. 

• Noncommutative quantum field theory, see statistics (http://xstructure.inr.ac. ru/x-bin/theme3.py?level=2& 
indexl=-173391) on arxiv.org 

Noncommutative standard model 



In theoretical particle physics, the non-commutative Standard Model, mainly due to the French mathematician 
Alain Connes, uses his noncommutative geometry to devise an extension of the Standard Model to include a 
modified form of general relativity. This unification implies a few constraints on the parameters of the Standard 
Model. Under an additional assumption, known as the "big desert" hypothesis, one of these constraints determines 
the mass of the Higgs boson to be around 170 GeV, comfortably within the range of the Large Hadron Collider. 
Recent Tevatron experiments exclude a Higgs mass of 158 to 175 GeV at the 95% confidence level J 1 ^ 

Background 

Current physical theory features four elementary forces: the gravitational force, the electromagnetic force, the weak 
force, and the strong force. Gravity has an elegant and experimentally precise theory: Einstein's general relativity. It 
is based on Riemannian geometry and interprets the gravitational force as curvature of space-time. Its Lagrangian 
formulation requires only two empirical parameters, the gravitational constant and the cosmological constant. 

The other three forces also have a Lagrangian theory, called the Standard Model. Its underlying idea is that they are 
mediated by the exchange of spin-1 particles, the so-called gauge bosons. The one responsible for electromagnetism 
is the photon. The weak force is mediated by the W and Z bosons; the strong force, by gluons. The gauge Lagrangian 
is much more complicated than the gravitational one: at present, it involves some 30 real parameters, a number that 
could increase. What is more, the gauge Lagrangian must also contain a spin 0 particle, the Higgs boson, to give 
mass to the spin 1/2 and spin 1 particles. This particle has yet to be observed, and if it is not detected at the Large 
Hadron Collider in Geneva, the consistency of the Standard Model is in doubt. 

Alain Connes has generalized Bernhard Riemann's geometry to noncommutative geometry. It describes spaces with 
curvature and uncertainty. Historically, the first example of such a geometry is quantum mechanics, which 
introduced Heisenberg's uncertainty relation by turning the classical observables of position and momentum into 
noncommuting operators. Noncommutative geometry is still sufficiently similar to Riemannian geometry that 
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Connes was able to rederive general relativity. In doing so, he obtained the gauge Lagrangian as a companion of the 
gravitational one, a truly geometric unification of all four fundamental interactions. Connes has thus devised a fully 
geometric formulation of the Standard Model, where all the parameters are geometric invariants of a 
noncommutative space. A result is that parameters like the electron mass are now analogous to purely mathematical 
constants like pi. 

Notes 

[1] The TEVNPH Working Group (http://arxiv.org/abs/1007.4587) 
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Nonabelian Gauge Theory 

In physics, a gauge theory is a type of field theory in which the Lagrangian is invariant under a continuous group of 
local transformations. 

The term gauge refers to redundant degrees of freedom in the Lagrangian. The transformations between possible 
gauges, called gauge transformations, form a Lie group which is referred to as the symmetry group or the gauge 
group of the theory. Associated with any Lie group is the Lie algebra of group generators. For each group generator 
there necessarily arises a corresponding vector field called the gauge field. Gauge fields are included in the 
Lagrangian to ensure its in variance under the local group transformations (called gauge invariance). When such a 
theory is quantized, the quanta of the gauge fields are called gauge bosons. If the symmetry group is 
non-commutative, the gauge theory is referred to as non-abelian, the usual example being the Yang-Mills theory. 

Gauge theories are important as the successful field theories explaining the dynamics of elementary particles. 
Quantum electrodynamics is an abelian gauge theory with the symmetry group U(l) and has one gauge field, the 
electromagnetic field, with the photon being the gauge boson. The Standard Model is a non-abelian gauge theory 
with the symmetry group U(l)xSU(2)xSU(3) and has a total of twelve gauge bosons: the photon, three weak bosons 
and eight gluons. 

Many powerful theories in physics are described by Lagrangians which are invariant under some symmetry 
transformation groups. When they are invariant under a transformation identically performed at every point in the 
space in which the physical processes occur, they are said to have a global symmetry. The requirement of local 
symmetry, the cornerstone of gauge theories, is a stricter constraint. In fact, a global symmetry is just a local 
symmetry whose group's parameters are fixed in space-time. Gauge symmetries can be viewed as analogues of the 
equivalence principle of general relativity in which each point in spacetime is allowed a choice of local reference 
(coordinate) frame. Both symmetries reflect a redundancy in the description of a system. 

Historically, these ideas were first stated in the context of classical electromagnetism and later in general relativity. 
However, the modern importance of gauge symmetries appeared first in the relativistic quantum mechanics of 
electrons — quantum electrodynamics, elaborated on below. Today, gauge theories are useful in condensed matter, 
nuclear and high energy physics among other subfields. 

History and importance 

The earliest field theory having a gauge symmetry was Maxwell's formulation of electrodynamics in 1864. The 
importance of this symmetry remained unnoticed in the earliest formulations. Similarly unnoticed, Hilbert had 
derived the Einstein field equations by postulating the invariance of the action under a general coordinate 
transformation. Later Hermann Weyl, in an attempt to unify general relativity and electromagnetism, conjectured 
(incorrectly, as it turned out) that Eichinvarianz or invariance under the change of scale (or "gauge") might also be a 
local symmetry of general relativity. After the development of quantum mechanics, Weyl, Vladimir Fock and Fritz 
London modified gauge by replacing the scale factor with a complex quantity and turned the scale transformation 
into a change of phase — aU(l) gauge symmetry. This explained the electromagnetic field effect on the wave 
function of a charged quantum mechanical particle. This was the first widely recognised gauge theory, popularised 
by Pauli in the 1940s. [1] 

In 1954, attempting to resolve some of the great confusion in elementary particle physics, Chen Ning Yang and 
Robert Mills introduced non-abelian gauge theories as models to understand the strong interaction holding together 
nucleons in atomic nuclei. (Ronald Shaw, working under Abdus Salam, independently introduced the same notion in 
his doctoral thesis.) Generalizing the gauge invariance of electromagnetism, they attempted to construct a theory 
based on the action of the (non-abelian) SU(2) symmetry group on the isospin doublet of protons and neutrons. This 
is similar to the action of the U(l) group on the spinor fields of quantum electrodynamics. In particle physics the 
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emphasis was on using quantized gauge theories. 

This idea later found application in the quantum field theory of the weak force, and its unification with 
electromagnetism in the electroweak theory. Gauge theories became even more attractive when it was realized that 
non-abelian gauge theories reproduced a feature called asymptotic freedom. Asymptotic freedom was believed to be 
an important characteristic of strong interactions. This motivated searching for a strong force gauge theory. This 
theory, now known as quantum chromodynamics, is a gauge theory with the action of the SU(3) group on the color 
triplet of quarks. The Standard Model unifies the description of electromagnetism, weak interactions and strong 
interactions in the language of gauge theory. 

In the 1970s, Sir Michael Atiyah began studying the mathematics of solutions to the classical Yang-Mills equations. 
In 1983, Atiyah's student Simon Donaldson built on this work to show that the differentiable classification of smooth 
4-manifolds is very different from their classification up to homeomorphism. Michael Freedman used Donaldson's 
work to exhibit exotic R 4 s, that is, exotic differentiable structures on Euclidean 4-dimensional space. This led to an 
increasing interest in gauge theory for its own sake, independent of its successes in fundamental physics. In 1994, 
Edward Witten and Nathan Seiberg invented gauge-theoretic techniques based on supersymmetry which enabled the 
calculation of certain topological invariants. These contributions to mathematics from gauge theory have led to a 
renewed interest in this area. 

The importance of gauge theories for physics stems from the tremendous success of the mathematical formalism in 
providing a unified framework to describe the quantum field theories of electromagnetism, the weak force and the 
strong force. This theory, known as the Standard Model, accurately describes experimental predictions regarding 
three of the four fundamental forces of nature, and is a gauge theory with the gauge group SU(3) x SU(2) x U(l). 
Modern theories like string theory, as well as some formulations of general relativity, are, in one way or another, 
gauge theories. 

Hi 

See Pickering for more about the history of gauge and quantum field theories. 

Description 

Global and local symmetries 

In physics, the mathematical description of any physical situation usually contains excess degrees of freedom; the 
same physical situation is equally well described by many equivalent mathematical configurations. For instance, in 
Newtonian dynamics, if two configurations are related by a Galilean transformation — an inertial change of reference 
frame — they represent the same physical situation. These transformations form a group of "symmetries" of the 
theory, and a physical situation corresponds not to an individual mathematical configuration but to a class of 
configurations related to one another by this symmetry group. This idea can be generalized to include local as well as 
global symmetries, analogous to much more abstract "changes of coordinates" in a situation where there is no 
preferred "inertial" coordinate system that covers the entire physical system. A gauge theory is a mathematical 
model that has symmetries of this kind, together with a set of techniques for making physical predictions consistent 
with the symmetries of the model. 

Example of global symmetry 

When a quantity occurring in the mathematical configuration is not just a number but has some geometrical 
significance, such as a velocity or an axis of rotation, its representation as numbers arranged in a vector or matrix is 
also changed by a coordinate transformation. For instance, if one description of a pattern of fluid flow states that the 
fluid velocity in the neighborhood of (x=l, y=0) is 1 m/s in the positive x direction, then a description of the same 
situation in which the coordinate system has been rotated clockwise by 90 degrees will state that the fluid velocity in 
the neighborhood of (x=0, y=l) is 1 m/s in the positive y direction. The coordinate transformation has affected both 
the coordinate system used to identify the location of the measurement and the basis in which its value is expressed. 
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As long as this transformation is performed globally (affecting the coordinate basis in the same way at every point), 
the effect on values that represent the rate of change of some quantity along some path in space and time as it passes 
through point P is the same as the effect on values that are truly local to P. 

Use of fiber bundles to describe local symmetries 

In order to adequately describe physical situations in more complex theories, it is often necessary to introduce a 
"coordinate basis" for some of the objects of the theory that do not have this simple relationship to the coordinates 
used to label points in space and time. (In mathematical terms, the theory involves a fiber bundle in which the fiber 
at each point of the base space consists of possible coordinate bases for use when describing the values of objects at 
that point.) In order to spell out a mathematical configuration, one must choose a particular coordinate basis at each 
point (a local section of the fiber bundle) and express the values of the objects of the theory (usually "fields" in the 
physicist's sense) using this basis. Two such mathematical configurations are equivalent (describe the same physical 
situation) if they are related by a transformation of this abstract coordinate basis (a change of local section, or gauge 
transformation) . 

In most gauge theories, the set of possible transformations of the abstract gauge basis at an individual point in space 
and time is a finite-dimensional Lie group. The simplest such group is U(l), which appears in the modern 
formulation of quantum electrodynamics (QED) via its use of complex numbers. QED is generally regarded as the 
first, and simplest, physical gauge theory. The set of possible gauge transformations of the entire configuration of a 
given gauge theory also forms a group, the gauge group of the theory. An element of the gauge group can be 
parameterized by a smoothly varying function from the points of spacetime to the (finite-dimensional) Lie group, 
whose value at each point represents the action of the gauge transformation on the fiber over that point. 

A gauge transformation with constant parameter at every point in space and time is analogous to a rigid rotation of 
the geometric coordinate system; it represents a global symmetry of the gauge representation. As in the case of a 
rigid rotation, this gauge transformation affects expressions that represent the rate of change along a path of some 
gauge-dependent quantity in the same way as those that represent a truly local quantity. A gauge transformation 
whose parameter is not a constant function is referred to as a local symmetry; its effect on expressions that involve a 
derivative is qualitatively different from that on expressions that don't. (This is analogous to a non-inertial change of 
reference frame, which can produce a Coriolis effect.) 

Gauge fields 

The "gauge covariant" version of a gauge theory accounts for this effect by introducing a gauge field (in 
mathematical language, an Ehresmann connection) and formulating all rates of change in terms of the covariant 
derivative with respect to this connection. The gauge field becomes an essential part of the description of a 
mathematical configuration. A configuration in which the gauge field can be eliminated by a gauge transformation 
has the property that its field strength (in mathematical language, its curvature) is zero everywhere; a gauge theory is 
not limited to these configurations. In other words, the distinguishing characteristic of a gauge theory is that the 
gauge field does not merely compensate for a poor choice of coordinate system; there is generally no gauge 
transformation that makes the gauge field vanish. 

When analyzing the dynamics of a gauge theory, the gauge field must be treated as a dynamical variable, similarly to 
other objects in the description of a physical situation. In addition to its interaction with other objects via the 
covariant derivative, the gauge field typically contributes energy in the form of a "self-energy" term. One can obtain 
the equations for the gauge theory by: 

• starting from a naive ansatz without the gauge field (in which the derivatives appear in a "bare" form); 

• listing those global symmetries of the theory that can be characterized by a continuous parameter (generally an 
abstract equivalent of a rotation angle); 

• computing the correction terms that result from allowing the symmetry parameter to vary from place to place; and 
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• reinterpreting these correction terms as couplings to one or more gauge fields, and giving these fields appropriate 
self-energy terms and dynamical behavior. 

This is the sense in which a gauge theory "extends" a global symmetry to a local symmetry, and closely resembles 
the historical development of the gauge theory of gravity known as general relativity. 

Physical experiments 

Gauge theories are used to model the results of physical experiments, essentially by: 

• limiting the universe of possible configurations to those consistent with the information used to set up the 
experiment, and then 

• computing the probability distribution of the possible outcomes that the experiment is designed to measure. 

The mathematical descriptions of the "setup information" and the "possible measurement outcomes" (loosely 
speaking, the "boundary conditions" of the experiment) are generally not expressible without reference to a particular 
coordinate system, including a choice of gauge. (If nothing else, one assumes that the experiment has been 
adequately isolated from "external" influence, which is itself a gauge-dependent statement.) Mishandling gauge 
dependence in boundary conditions is a frequent source of anomalies in gauge theory calculations, and gauge 
theories can be broadly classified by their approaches to anomaly avoidance. 

Continuum theories 

The two gauge theories mentioned above (continuum electrodynamics and general relativity) are examples of 
continuum field theories. The techniques of calculation in a continuum theory implicitly assume that: 

• given a completely fixed choice of gauge, the boundary conditions of an individual configuration can in principle 
be completely described; 

• given a completely fixed gauge and a complete set of boundary conditions, the principle of least action determines 
a unique mathematical configuration (and therefore a unique physical situation) consistent with these bounds; 

• the likelihood of possible measurement outcomes can be determined by: 

• establishing a probability distribution over all physical situations determined by boundary conditions that are 
consistent with the setup information, 

• establishing a probability distribution of measurement outcomes for each possible physical situation, and 

• convolving these two probability distributions to get a distribution of possible measurement outcomes 
consistent with the setup information; and 

• fixing the gauge introduces no anomalies in the calculation, due either to gauge dependence in describing partial 
information about boundary conditions or to incompleteness of the theory. 

These assumptions are close enough to valid, across a wide range of energy scales and experimental conditions, to 
allow these theories to make accurate predictions about almost all of the phenomena encountered in daily life, from 
light, heat, and electricity to eclipses and spaceflight. They fail only at the smallest and largest scales (due to 
omissions in the theories themselves) and when the mathematical techniques themselves break down (most notably 
in the case of turbulence and other chaotic phenomena). 

Quantum field theories 

Other than these "classical" continuum field theories, the most widely known gauge theories are quantum field 
theories, including quantum electrodynamics and the Standard Model of elementary particle physics. The starting 
point of a quantum field theory is much like that of its continuum analog: a gauge-co variant action integral which 
characterizes "allowable" physical situations according to the principle of least action. However, continuum and 
quantum theories differ significantly in how they handle the excess degrees of freedom represented by gauge 
transformations. Continuum theories, and most pedagogical treatments of the simplest quantum field theories, use a 
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gauge fixing prescription to reduce the orbit of mathematical configurations that represent a given physical situation 
to a smaller orbit related by a smaller gauge group (the global symmetry group, or perhaps even the trivial group). 

More sophisticated quantum field theories, in particular those which involve a non-abelian gauge group, break the 
gauge symmetry within the techniques of perturbation theory by introducing additional fields (the Faddeev-Popov 
ghosts) and counterterms motivated by anomaly cancellation, in an approach known as BRST quantization. While 
these concerns are in one sense highly technical, they are also closely related to the nature of measurement, the limits 
on knowledge of a physical situation, and the interactions between incompletely specified experimental conditions 
and incompletely understood physical theory. The mathematical techniques that have been developed in order to 
make gauge theories tractable have found many other applications, from solid-state physics and crystallography to 
low-dimensional topology. 

Classical gauge theory 
Classical electromagnetism 

Historically, the first example of gauge symmetry to be discovered was classical electromagnetism. In static 
electricity, one can either discuss the electric field, E, or its corresponding electric potential, V. Knowledge of one 
makes it possible to find the other, except that potentials differing by a constant, V — > V + C , correspond to the 
same electric field. This is because the electric field relates to changes in the potential from one point in space to 
another, and the constant C would cancel out when subtracting to find the change in potential. In terms of vector 
calculus, the electric field is the gradient of the potential, E = — WV- Generalizing from static electricity to 
electromagnetism, we have a second potential, the vector potential A, with 



where / is any function that depends on position and time. The fields remain the same under the gauge 
transformation, and therefore Maxwell's equations are still satisfied. That is, Maxwell's equations have a gauge 
symmetry. 

An example: Scalar 0(#i) gauge theory 

The remainder of this section requires some familiarity with classical or quantum field theory, and the use of 



Definitions in this section: gauge group, gauge field, interaction Lagrangian, gauge boson. 

The following illustrates how local gauge invariance can be "motivated" heuristically starting from global symmetry 
properties, and how it leads to an interaction between fields which were originally non-interacting. 

Consider a set of n non-interacting scalar fields, with equal masses m. This system is described by an action which is 
the sum of the (usual) action for each scalar field ip% 



B = Vx A . 

The general gauge transformations now become not just V — > V + C but 

A + V/ 





Lagrangians. 




The Lagrangian (density) can be compactly written as 
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by introducing a vector of fields 

The term is Einstein notation for the partial derivative of $in each of the four dimensions. It is now transparent 
that the Lagrangian is invariant under the transformation 
$ h-> <J>' = 

whenever G is a constant matrix belonging to the n-by-n orthogonal group 0(n). This is seen to preserve the 
Lagrangian since the derivative of $will transform identically to $and both quantities appear inside dot products 
in the Lagrangian (orthogonal transformations preserve the dot product). 

This characterizes the global symmetry of this particular Lagrangian, and the symmetry group is often called the 
gauge group; the mathematical term is structure group, especially in the theory of G-structures. Incidentally, 
Noether's theorem implies that in variance under this group of transformations leads to the conservation of the 
current 

J* = id^ T T a ® 

where the matrices are generators of the SO(^z) group. There is one conserved current for every generator. 

Now, demanding that this Lagrangian should have local 0(n) -in variance requires that the G matrices (which were 
earlier constant) should be allowed to become functions of the space-time coordinates x. 

Unfortunately, the G matrices do not "pass through" the derivatives, when G = G(x), 

The failure of the derivative to commute with "G" introduces an additional term (in keeping with the product rule) 
which spoils the invariance of the Lagrangian. In order to rectify this we define a new derivative operator such that 
the derivative of $will again transform identically with $ 

(Dp®)' = GDfl. 

This new "derivative" is called a covariant derivative and takes the form 

Dp = dp + gAp 

Where g is called the coupling constant - a quantity defining the strength of an interaction. After a simple 
calculation we can see that the gauge field A(x) must transform as follows 

A'p = GApG -1 — \d ft G)G~ 1 

The gauge field is an element of the Lie algebra, and can therefore be expanded as 

a 

There are therefore as many gauge fields as there are generators of the Lie algebra. 
Finally, we now have a locally gauge invariant Lagrangian 

Aoc = \(D^) T D^ - l -m 2 ® T ®. 

Pauli calls gauge transformation of the first type to the one applied to fields as while the compensating 
transformation in A is sa id to be a gauge transformation of the second type. 
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The difference between this Lagrangian and the original globally 
gauge-invariant Lagrangian is seen to be the interaction 
Lagrangian 




Feynman diagram of scalar bosons interacting via a 
gauge boson 



Ant = !$ T A^>$ + |(d„$) T A"$ + ^-(A^fA^. 

This term introduces interactions between the n scalar fields just as a consequence of the demand for local gauge 
invariance. However, to make this interaction physical and not completely arbitrary, the mediator A(x) needs to 
propagate in space. That is dealt with in the next section by adding yet another term, , to the Lagrangian. In the 

quantized version of the obtained classical field theory, the quanta of the gauge field A(x) are called gauge bosons. 
The interpretation of the interaction Lagrangian in quantum field theory is of scalar bosons interacting by the 
exchange of these gauge bosons. 

The Yang-Mills Lagrangian for the gauge field 

The picture of a classical gauge theory developed in the previous section is almost complete, except for the fact that 
to define the covariant derivatives D, one needs to know the value of the gauge field A(x)at all space-time points. 
Instead of manually specifying the values of this field, it can be given as the solution to a field equation. Further 
requiring that the Lagrangian which generates this field equation is locally gauge invariant as well, one possible form 
for the gauge field Lagrangian is (conventionally) written as 

£ gf = -^TV(F^) 

with 

F fU/ = ^[D fl ,D v ] 
W 

and the trace being taken over the vector space of the fields. This is called the Yang-Mills action. Other gauge 
invariant actions also exist (e.g. nonlinear electrodynamics, Born-Infeld action, Chern-Simons model, theta term 
etc.). 

Note that in this Lagrangian term there is no field whose transformation counterweighs the one of A • Invariance of 
this term under gauge transformations is a particular case of a priori classical (geometrical) symmetry. This 
symmetry must be restricted in order to perform quantization, the procedure being denominated gauge fixing, but 
even after restriction, gauge transformations may be possible. 1 

The complete Lagrangian for the gauge theory is now 

— ^loc H" ^gf — ^global ~T" ^int ~~T" ^gf 
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An example: Electrodynamics 

As a simple application of the formalism developed in the previous sections, consider the case of electrodynamics, 
with only the electron field. The bare-bones action which generates the electron field's Dirac equation is 

The global symmetry for this system is 
1p h-> e i0 ljj. 

The gauge group here is U(l), just the phase angle of the field, with a constant 6. 
"Localising this symmetry implies the replacement of 9 by 9(x). 
An appropriate covariant derivative is then 

Dp = dp - i—Ap. 

Identifying the "charge" e with the usual electric charge (this is the origin of the usage of the term in gauge theories), 
and the gauge field A(x) with the four- vector potential of electromagnetic field results in an interaction Lagrangian 

Ant = ^(l)r^l)^(l) = J»{x)Ap( X ). 

where J^(x)is the usual four vector electric current density. The gauge principle is therefore seen to naturally 
introduce the so-called minimal coupling of the electromagnetic field to the electron field. 

Adding a Lagrangian for the gauge field ^4 /x (x)in terms of the field strength tensor exactly as in electrodynamics, 
one obtains the Lagrangian which is used as the starting point in quantum electrodynamics. 

£ QED = ^{ihc-fD,. - mc 2 )ip - -^-F^F^. 

See also: Dirac equation, Maxwell's equations, Quantum electrodynamics 

Mathematical formalism 

Gauge theories are usually discussed in the language of differential geometry. Mathematically, a gauge is just a 
choice of a (local) section of some principal bundle. A gauge transformation is just a transformation between two 
such sections. 

Although gauge theory is dominated by the study of connections (primarily because it's mainly studied by 
high-energy physicists), the idea of a connection is not central to gauge theory in general. In fact, a result in general 
gauge theory shows that affine representations (i.e. affine modules) of the gauge transformations can be classified as 
sections of a jet bundle satisfying certain properties. There are representations which transform covariantly pointwise 
(called by physicists gauge transformations of the first kind), representations which transform as a connection form 
(called by physicists gauge transformations of the second kind, an affine representation) and other more general 
representations, such as the B field in BF theory. There are more general nonlinear representations (realizations), but 
are extremely complicated. Still, nonlinear sigma models transform nonlinearly, so there are applications. 

If there is a principal bundle P whose base space is space or spacetime and structure group is a Lie group, then the 
sections of P form a principal homogeneous space of the group of gauge transformations. 

connection (gauge connection) define this principal bundle, yielding a covariant derivative V in each associated 
vector bundle. If a local frame is chosen (a local basis of sections), then this covariant derivative is represented by 
the connection form A, a Lie algebra- valued 1-form which is called the gauge potential in physics. This is evidently 
not an intrinsic but a frame-dependent quantity. The curvature form F is constructed from a connection form, a Lie 
algebra- valued 2-form which is an intrinsic quantity, by 

F = d A + A A A 
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where d stands for the exterior derivative and A stands for the wedge product. ( A is an element of the vector space 
spanned by the generators y a '» an d so the components of A do not commute with one another. Hence the wedge 
product A A A does not vanish.) 

Infinitesimal gauge transformations form a Lie algebra, which is characterized by a smooth Lie algebra valued 
scalar, 8. Under such an infinitesimal gauge transformation, 



Also, 6 £ F = eF, which means Ftransforms covariantly. 

Not all gauge transformations can be generated by infinitesimal gauge transformations in general. An example is 
when the base manifold is a compact manifold without boundary such that the homotopy class of mappings from that 
manifold to the Lie group is nontrivial. See instanton for an example. 

The Yang— Mills action is now given by 



where * stands for the Hodge dual and the integral is defined as in differential geometry. 

A quantity which is gauge-invariant i.e. invariant under gauge transformations is the Wilson loop, which is defined 
over any closed path, y, as follows: 



where % is the character of a complex representation p and *p represents the path-ordered operator. 

Quantization of gauge theories 

Gauge theories may be quantized by specialization of methods which are applicable to any quantum field theory. 
However, because of the subtleties imposed by the gauge constraints (see section on Mathematical formalism, 
above) there are many technical problems to be solved which do not arise in other field theories. At the same time, 
the richer structure of gauge theories allow simplification of some computations: for example Ward identities 
connect different renormalization constants. 

Methods and aims 

The first gauge theory to be quantized was quantum electrodynamics (QED). The first methods developed for this 
involved gauge fixing and then applying canonical quantization. The Gupta-Bleuler method was also developed to 
handle this problem. Non-abelian gauge theories are now handled by a variety of means. Methods for quantization 
are covered in the article on quantization. 

The main point to quantization is to be able to compute quantum amplitudes for various processes allowed by the 
theory. Technically, they reduce to the computations of certain correlation functions in the vacuum state. This 
involves a renormalization of the theory. 

When the running coupling of the theory is small enough, then all required quantities may be computed in 
perturbation theory. Quantization schemes intended to simplify such computations (such as canonical quantization) 
may be called perturbative quantization schemes. At present some of these methods lead to the most precise 
experimental tests of gauge theories. 

However, in most gauge theories, there are many interesting questions which are non-perturbative. Quantization 
schemes suited to these problems (such as lattice gauge theory) may be called non-perturbative quantization 



S £ A = [e, A] — de 

where -]is the Lie bracket. 



One nice thing is that if 6 £ X = eX , then 8 £ DX = sDX where D is the co variant derivative 

DX = dX + AX. 
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schemes. Precise computations in such schemes often require supercomputing, and are therefore less well-developed 
currently than other schemes. 

Anomalies 

Some of the symmetries of the classical theory are then seen not to hold in the quantum theory — a phenomenon 
called an anomaly. Among the most well known are: 

• The scale anomaly, which gives rise to a running coupling constant. In QED this gives rise to the phenomenon of 
the Landau pole. In Quantum Chromodynamics (QCD) this leads to asymptotic freedom. 

• The chiral anomaly in either chiral or vector field theories with fermions. This has close connection with topology 
through the notion of instantons. In QCD this anomaly causes the decay of a pion to two photons. 

• The gauge anomaly, which must cancel in any consistent physical theory. In the electroweak theory this 
cancellation requires an equal number of quarks and leptons. 

Pure gauge 

A pure gauge is the set of field configurations obtained by a gauge transformation on the null field configuration. So 
it is a particular "gauge orbit" in the field configuration's space. 

In the abelian case, where A^[x) — v A f ^{x) = A^{po) + d^f{x) , the pure gauge is the set of field 
configurations A'^x) = d fl f(x) for all f(x). 
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External links 

• Yang-Mills equations on Dispersive Wiki 

rm 

• Gauge theories on Scholarpedia 
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List of quantum field theories 

List of quantum field theories: 

• Chern-Simons model 

• Chiral model 

• Gross-Neveu 

• Kondo model 

• Lower dimensional quantum field theory 

• Minimal model 

• Nambu-Jona-Lasinio 

• Noncommutative quantum field theory 

• Nonlinear sigma model 

• Phi to the fourth 

• Quantum chromodynamics 

• Quantum electrodynamics 

• Quantum flavordynamics 

• Quantum Yang-Mills theory 

• Schwinger model 

• Sine-Gordon 

• Standard model 

• String Theory 

• Thirring model 

• Toda field theory 

• Topological quantum field theory 

• Wess-Zumino model 

• Wess-Zumino-Witten model 

• Yang-Mills 

• Yang-Mills-Higgs model 

• Yukawa model 
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Noncommutative geometry 

Noncommutative geometry (NCG), is a branch of mathematics concerned with geometric approach to 
noncommutative algebras, and with construction of spaces which are locally presented by noncommutative algebras 
of functions (possibly in some generalized sense). A noncommutative algebra is here an associative algebra in which 
the multiplication is not commutative, that is, for which xy does not always equal yx\ or more generally an algebraic 
structure in which one of the principal binary operations is not commutative; one also allows additional structures, 
e.g. topology or norm to be possibly carried by the noncommutative algebra of functions. The leading direction in 
noncommutative geometry has been laid by French mathematician Alain Connes since his involvement from about 
1979. 

Motivation 

Main motivation is to extend the commutative duality between spaces and functions to the noncommutative setting. 
In mathematics, there is a close relationship between spaces, which are geometric in nature, and the numerical 
functions on them. In general, such functions will form a commutative ring. For instance, one may take the ring C(X) 
of continuous complex- valued functions on a topological space X. In many important cases (e.g., if X is a compact 
Hausdorff space), we can recover X from C(X), and therefore it makes some sense to say that X has commutative 
geometry. 

More specifically, in topology, compact Hausdorff topological spaces can be reconstructed from the Banach algebra 
of functions on the space (Gel'fand-Neimark). In commutative algebraic geometry, algebraic schemes are locally 
prime spectra of commutative unital rings (A. Grothendieck), and schemes can be reconstructed from the categories 
of quasicoherent sheaves of modules on them (P. Gabriel-A. Rosenberg). For Grothendieck topologies, the 
cohomological properties of a site are invariant of the corresponding category of sheaves of sets viewed abstractly as 
a topos (A. Grothendieck). In all these cases, a space is reconstructed from the algebra of functions or its categorified 
version — some category of sheaves on that space. 

Functions on a topological space can be multiplied and added pointwise hence they form a commutative algebra; in 
fact these operations are local in the topology of the base space, hence the functions form a sheaf of commutative 
rings over the base space. 

The dream of noncommutative geometry is to generalize this duality to the duality between 

• noncommutative algebras, or sheaves of noncommutative algebras, or sheaf-like noncommutative algebraic or 
operator-algebraic structures 

• and geometric entities of certain kind, 

and interact between the algebraic and geometric description of those via this duality. 

Regarding that the commutative rings correspond to usual affine schemes, and commutative C*-algebras to usual 
topological spaces, the extension to noncommutative rings and algebras requires non-trivial generalization of 
topological spaces, as "non-commutative spaces". For this reason, some talk about non-commutative topology, 
though the term has also other meanings. 
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Applications in mathematical physics 

Some applications in particle physics are described on the entries Noncommutative standard model and 
Noncommutative quantum field theory. Sudden rise in interest in noncommutative geometry in physics, follows after 
the speculations of its role in M-theory made in 1997^ . 

Motivation from ergodic theory 

Some of the theory developed by Alain Connes to handle noncommutative geometry at a technical level has roots in 
older attempts, in particular in ergodic theory. The proposal of George Mackey to create a virtual subgroup theory, 
with respect to which ergodic group actions would become homogeneous spaces of an extended kind, has by now 
been subsumed. 

Non-commutative C* -algebras, von Neumann algebras 

(The formal duals of) non-commutative C*-algebras are often now called non-commutative spaces. This is by 
analogy with the Gelfand representation, which shows that commutative C* -algebras are dual to locally compact 
Hausdorff spaces. In general, one can associate to any C*-algebra S a topological space S; see spectrum of a 
C*-algebra. 

For the duality between a-finite measure spaces and commutative von Neumann algebras, noncommutative von 
Neumann algebras are called non-commutative measure spaces. 

Non-commutative differentiable manifolds 

A smooth Riemannian manifold M is a topological space with a lot of extra structure. From its algebra of continuous 
functions C(M) we only recover M topologically. The algebraic invariant that recovers the Riemannian structure is a 
spectral triple. It is constructed from a smooth vector bundle E over M, e.g. the exterior algebra bundle. The Hilbert 
space L2(M,E) of square integrable sections of E carries a representation of C(M) by multiplication operators, and we 
consider an unbounded operator D in L^M^) with compact resolvent (e.g. the signature operator), such that the 
commutators [D,f] are bounded whenever / is smooth. A recent deep theorem states that M as a Riemannian 
manifold can be recovered from this data. 

This suggests that one might define a noncommutative Riemannian manifold as a spectral triple (A,H,D), consisting 
of a representation of a C*-algebra A on a Hilbert space H, together with an unbounded operator D on H, with 
compact resolvent, such that [D,a] is bounded for all a in some dense subalgebra of A. Research in spectral triples is 
very active, and many examples of noncommutative manifolds have been constructed. 

Non-commutative affine and projective schemes 

In analogy to the duality between affine schemes and commutative rings, we define a category of noncommutative 
affine schemes as the dual of the category of associative unital rings. There are certain analogues of Zariski topology 
in that context so that one can glue such affine schemes to more general objects. 

There are also generalizations of the Cone and of the Proj of a commutative graded ring, mimicking a Serre's 
theorem on Proj. Namely the category of quasicoherent sheaves of O-modules on a Proj of a commutative graded 
algebra is equivalent to the category of graded modules over the ring localized on Serre's subcategory of graded 
modules of finite length; there is also analogous theorem for coherent sheaves when the algebra is Noetherian. This 
theorem is extended as a definition of noncommutative projective geometry by Michael Artin and J. J. Zhang , 
who add also some general ring-theoretic conditions (e.g. Artin- Schelter regularity). 

Many properties of projective schemes extend to this context. For example, there exist an analog of the celebrated 
Serre duality for noncommutative projective schemes of Artin and Zhang . 
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A. L. Rosenberg has created a rather general relative concept of noncommutative quasicompact scheme (over a 

base category), abstracting the Grothendieck's study of morphisms of schemes and covers in terms of categories of 

Mi 

quasicoherent sheaves and flat localization functors . There is also another interesting approach via localization 
theory, due to Fred Van Oystaeyen, Luc Willaert and Alain Verschoeren, where the main concept is that of a 
schematic algebra 1 ^ . 

Invariants for noncommutative spaces 

Some of the motivating questions of the theory are concerned with extending known topological invariants to formal 
duals of noncommutative (operator) algebras and other replacements and candidates for noncommutative spaces. 
One of the main starting points of the Alain Connes' direction in noncommutative geometry is his spectacular 
discovery (and independently by Boris Tsygan) of a very important new homology theory associated to 
noncommutative associative algebras and noncommutative operator algebras, namely the cyclic homology and its 
relations to the algebraic K-theory (primarily via Connes-Chern character map). 

The theory of characteristic classes of smooth manifolds has been extended to spectral triples, employing the tools of 
operator K-theory and cyclic cohomology. Several generalizations of now classical index theorems allow for 
effective extraction of numerical invariants from spectral triples. The fundamental characteristic class in cyclic 
cohomology, the JLO cocycle, generalizes the classical Chern character. 

Examples of non-commutative spaces 

• In Weyl quantization, the symplectic phase space of classical mechanics is deformed into a non-commutative 
phase space generated by the position and momentum operators. 

• The standard model of particle physics is another example of a noncommutative geometry, cf noncommutative 
standard model. 

• The noncommutative torus, deformation of the function algebra of the ordinary torus, can be given the structure 
of a spectral triple. This class of examples has been studied intensively and still functions as a test case for more 
complicated situations. 

• Snyder space ^ 

• Noncommutative algebras arising from foliations. 

• Examples related to dynamical systems arising from number theory, such as the Gauss shift on continued 
fractions, give rise to noncommutative algebras that appear to have interesting noncommutative geometries. 
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Quantum gravity 

Quantum gravity (QG) is the field of theoretical physics attempting to unify quantum mechanics with general 
relativity in a self-consistent manner, or more precisely, to formulate a self-consistent theory which reduces to 

2 

ordinary quantum mechanics in the limit of weak gravity (potentials much less than c ) and which reduces to 
Einsteinian general relativity in the limit of large actions (action much larger than reduced Planck's constant). The 
theory must be able to predict the outcome of situations where both quantum effects and strong-field gravity are 
important (at the Planck scale, unless large extra dimension conjectures are correct). Motivation for quantizing 
gravity comes from the remarkable success of the quantum theories of the other three fundamental interactions. 
Although some quantum gravity theories such as string theory and other so-called theories of everything attempt to 
unify gravity with the other fundamental forces, others such as loop quantum gravity make no such attempt; they 
simply quantize the gravitational field while keeping it separate from the other forces. 

Observed physical phenomena in the early 21st century can be described well by quantum mechanics or general 
relativity, without needing both. This can be thought of as due to an extreme separation of mass scales at which they 
are important. Quantum effects are usually important only for the "very small", that is, for objects no larger than 
typical molecules. General relativistic effects, on the other hand, show up only for the "very large" bodies such as 
collapsed stars. (Planets' gravitational fields, as of 2009, are well-described by linearized gravity; so strong-field 

2 

effects — any effects of gravity beyond lowest nonvanishing order in cp/c — have not been observed even in the 

gravitational fields of planets and main sequence stars). There is a lack of experimental evidence relating to quantum 

gravity and classical physics adequately describes the observed effects of gravity over a range of 50 orders of 

-23 30 

magnitude of mass, i.e. for masses of objects from about 10 to 10 kg. 

Overview 

Much of the difficulty in meshing these theories at all energy scales comes from the different assumptions that these 
theories make on how the universe works. Quantum field theory depends on particle fields embedded in the flat 
space-time of special relativity. General relativity models gravity as a curvature within space-time that changes as a 
gravitational mass moves. Historically, the most obvious way of combining the two (such as treating gravity as 
simply another particle field) ran quickly into what is known as the renormalization problem. In the old-fashioned 
understanding of renormalization, gravity particles would attract each other and adding together all of the 
interactions results in many infinite values which cannot easily be cancelled out mathematically to yield sensible, 
finite results. This is in contrast with quantum electrodynamics where, while the series still do not converge, the 
interactions sometimes evaluate to infinite results, but those are few enough in number to be removable via 
renormalization. 

Effective field theories 

Quantum gravity can be treated as an effective field theory. Effective quantum field theories come with some 
high-energy cutoff, beyond which we do not expect that the theory provides a good description of nature. The 
"infinities" then become large but finite quantities proportional to this finite cutoff scale, and correspond to processes 
that involve very high energies near the fundamental cutoff. These quantities can then be absorbed into an infinite 
collection of coupling constants, and at energies well below the fundamental cutoff of the theory, to any desired 
precision; only a finite number of these coupling constants need to be measured in order to make legitimate 
quantum-mechanical predictions. This same logic works just as well for the highly successful theory of low-energy 
pions as for quantum gravity. Indeed, the first quantum-mechanical corrections to graviton- scattering and Newton's 
law of gravitation have been explicitly computed^ (although they are so astronomically small that we may never be 
able to measure them). In fact, gravity is in many ways a much better quantum field theory than the Standard Model, 
since it appears to be valid all the way up to its cutoff at the Planck scale. (By comparison, the Standard Model is 
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expected to start to break down above its cutoff at the much smaller scale of around 1000 GeV.) 

While confirming that quantum mechanics and gravity are indeed consistent at reasonable energies, it is clear that 
near or above the fundamental cutoff of our effective quantum theory of gravity (the cutoff is generally assumed to 
be of order the Planck scale), a new model of nature will be needed. Specifically, the problem of combining quantum 
mechanics and gravity becomes an issue only at very high energies, and may well require a totally new kind of 
model. 

Quantum gravity theory for the highest energy scales 

The general approach to deriving a quantum gravity theory that is valid at even the highest energy scales is to 
assume that such a theory will be simple and elegant and, accordingly, to study symmetries and other clues offered 
by current theories that might suggest ways to combine them into a comprehensive, unified theory. One problem 
with this approach is that it is unknown whether quantum gravity will actually conform to a simple and elegant 
theory, as it should resolve the dual conundrums of special relativity with regard to the uniformity of acceleration 
and gravity, and general relativity with regard to spacetime curvature. 

Such a theory is required in order to understand problems involving the combination of very high energy and very 
small dimensions of space, such as the behavior of black holes, and the origin of the universe. 

Quantum mechanics and general relativity 



The graviton 

At present, one of the deepest problems in theoretical physics is harmonizing 
the theory of general relativity, which describes gravitation, and applies to 
large-scale structures (stars, planets, galaxies), with quantum mechanics, 
which describes the other three fundamental forces acting on the atomic scale. 
This problem must be put in the proper context, however. In particular, 
contrary to the popular claim that quantum mechanics and general relativity 
are fundamentally incompatible, one can demonstrate that the structure of 
general relativity essentially follows inevitably from the quantum mechanics 
of interacting theoretical spin-2 massless particles ^ ^ ^ ^ ^ (called 
gravitons). 

While there is no concrete proof of the existence of gravitons, quantized 
theories of matter may necessitate their existence. Supporting this theory is 
the observation that all other fundamental forces have one or more messenger 
particles, except gravity, leading researchers to believe that at least one most 
likely does exist; they have dubbed these hypothetical particles gravitons. 
Many of the accepted notions of a unified theory of physics since the 1970s, 
including string theory, superstring theory, M-theory, loop quantum gravity, 
all assume, and to some degree depend upon, the existence of the graviton. 
Many researchers view the detection of the graviton as vital to validating their 
work. 




Gravity Probe B (GP-B) has measured 
spacetime curvature near Earth to test 

related models in application of 
Einstein's general theory of relativity. 



Quantum gravity 



293 



The dilaton 

The dilaton made its first appearance in Kaluza-Klein theory, a five-dimensional theory that combined gravitation 
and electromagnetism. Generally, it appears in string theory. More recently, it has appeared in the lower-dimensional 
many-bodied gravity problem 1 based on the field theoretic approach of Roman Jackiw. The impetus arose from the 
fact that complete analytical solutions for the metric of a covariant Af-body system have proven elusive in General 
Relativity. To simplify the problem, the number of dimensions was lowered to (1+1) namely one spatial dimension 

roi 

and one temporal dimension. This model problem, known as R=T theory (as opposed to the general G=T theory) 
was amenable to exact solutions in terms of a generalization of the Lambert W function. It was also found that the 
field equation governing the dilaton (derived from differential geometry) was none other than the Schrodinger 

rm 

equation and consequently amenable to quantization. 1 Thus, one had a theory which combined gravity, quantization 
and even the electromagnetic interaction, promising ingredients of a fundamental physical theory. It is worth noting 
that the outcome revealed a previously unknown and already existing natural link between general relativity and 
quantum mechanics. However, this theory needs to be generalized in (2+1) or (3+1 ) dimensions although, in 
principle, the field equations are amenable to such generalization. It is not yet clear what field equation will govern 
the dilaton in higher dimensions. This is further complicated by the fact that gravitons can propagate in (3+1) 
dimensions and consequently that would imply gravitons and dilatons exist in the real world. Moreover, detection of 
the dilaton is expected to be even more elusive than the graviton. However, since this approach allows for the 
combination of gravitational, electromagnetic and quantum effects, their coupling could potentially lead to a means 
of vindicating the theory, through cosmology and perhaps even experimentally. 

Nonrenormalizability of gravity 

General relativity, like electromagnetism, is a classical field theory. One might expect that, as with 
electromagnetism, there should be a corresponding quantum field theory. 

However, gravity is nonrenormalizableJ 10 ^ For a quantum field theory to be well-defined according to this 
understanding of the subject, it must be asymptotically free or asymptotically safe. The theory must be characterized 
by a choice of finitely many parameters, which could, in principle, be set by experiment. For example, in quantum 
electrodynamics, these parameters are the charge and mass of the electron, as measured at a particular energy scale. 

On the other hand, in quantizing gravity, there are infinitely many independent parameters needed to define the 
theory. For a given choice of those parameters, one could make sense of the theory, but since we can never do 
infinitely many experiments to fix the values of every parameter, we do not have a meaningful physical theory: 

• At low energies, the logic of the renormalization group tells us that, despite the unknown choices of these 
infinitely many parameters, quantum gravity will reduce to the usual Einstein theory of general relativity. 

• On the other hand, if we could probe very high energies where quantum effects take over, then every one of the 
infinitely many unknown parameters would begin to matter, and we could make no predictions at all. 

As explained below, there is a way around this problem by treating QG as an effective field theory. 

Any meaningful theory of quantum gravity that makes sense and is predictive at all energy scales must have some 
deep principle that reduces the infinitely many unknown parameters to a finite number that can then be measured. 

• One possibility is that normal perturbation theory is not a reliable guide to the renormalizability of the theory, and 
that there really is a UV fixed point for gravity. Since this is a question of non-perturbative quantum field theory, 
it is difficult to find a reliable answer, but some people still pursue this option. 

• Another possibility is that there are new symmetry principles that constrain the parameters and reduce them to a 
finite set. This is the route taken by string theory, where all of the excitations of the string essentially manifest 
themselves as new symmetries. 
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QG as an effective field theory 

In an effective field theory, all but the first few of the infinite set of parameters in a non-renormalizable theory are 
suppressed by huge energy scales and hence can be neglected when computing low-energy effects. Thus, at least in 
the low-energy regime, the model is indeed a predictive quantum field theory J 1 ^ (A very similar situation occurs for 
the very similar effective field theory of low-energy pions.) Furthermore, many theorists agree that even the 
Standard Model should really be regarded as an effective field theory as well, with "nonrenormalizable" interactions 
suppressed by large energy scales and whose effects have consequently not been observed experimentally. 

Recent work^ has shown that by treating general relativity as an effective field theory, one can actually make 
legitimate predictions for quantum gravity, at least for low-energy phenomena. An example is the well-known 
calculation of the tiny first-order quantum-mechanical correction to the classical Newtonian gravitational potential 
between two masses. 

Spacetime background dependence 

A fundamental lesson of general relativity is that there is no fixed spacetime background, as found in Newtonian 
mechanics and special relativity; the spacetime geometry is dynamic. While easy to grasp in principle, this is the 
hardest idea to understand about general relativity, and its consequences are profound and not fully explored, even at 
the classical level. To a certain extent, general relativity can be seen to be a relational theory,^ 1 ^ in which the only 
physically relevant information is the relationship between different events in space-time. 

On the other hand, quantum mechanics has depended since its inception on a fixed background (non-dynamic) 
structure. In the case of quantum mechanics, it is time that is given and not dynamic, just as in Newtonian classical 
mechanics. In relativistic quantum field theory, just as in classical field theory, Minkowski spacetime is the fixed 
background of the theory. 

String theory 

String theory started out as a generalization of quantum field 
theory where instead of point particles, string-like objects 
propagate in a fixed spacetime background. Although string theory 
had its origins in the study of quark confinement and not of 
quantum gravity, it was soon discovered that the string spectrum 
contains the graviton, and that "condensation" of certain vibration 
modes of strings is equivalent to a modification of the original 
background. In this sense, string perturbation theory exhibits 
exactly the features one would expect of a perturbation theory that 
may exhibit a strong dependence on asymptotics (as seen, for 
example, in the AdS/CFT correspondence) which is a weak form 
of background dependence. 

Background independent theories 

Loop quantum gravity is the fruit of an effort to formulate a background-independent quantum theory. 

Topological quantum field theory provided an example of background-independent quantum theory, but with no 
local degrees of freedom, and only finitely many degrees of freedom globally. This is inadequate to describe gravity 
in 3+1 dimensions which has local degrees of freedom according to general relativity. In 2+1 dimensions, however, 
gravity is a topological field theory, and it has been successfully quantized in several different ways, including spin 
networks. 




Interaction in the subatomic world: world lines of 
point-like particles in the Standard Model or a world 
sheet swept up by closed strings in string theory 
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Semi-classical quantum gravity 

Quantum field theory on curved (non-Minkowskian) backgrounds, while not a full quantum theory of gravity, has 
shown many promising early results. In an analogous way to the development of quantum electrodynamics in the 
early part of the 20th century (when physicists considered quantum mechanics in classical electromagnetic fields), 
the consideration of quantum field theory on a curved background has led to predictions such as black hole radiation. 

Phenomena such as the Unruh effect, in which particles exist in certain accelerating frames but not in stationary 
ones, do not pose any difficulty when considered on a curved background (the Unruh effect occurs even in flat 
Minkowskian backgrounds). The vacuum state is the state with least energy (and may or may not contain particles). 
See Quantum field theory in curved spacetime for a more complete discussion. 

Points of tension 

There are other points of tension between quantum mechanics and general relativity. 

• First, classical general relativity breaks down at singularities, and quantum mechanics becomes inconsistent with 
general relativity in the neighborhood of singularities (however, no one is certain that classical general relativity 
applies near singularities in the first place). 

• Second, it is not clear how to determine the gravitational field of a particle, since under the Heisenberg 

uncertainty principle of quantum mechanics its location and velocity cannot be known with certainty. The 

ri2i 

resolution of these points may come from a better understanding of general relativity. 

• Third, there is the Problem of Time in quantum gravity. Time has a different meaning in quantum mechanics and 
general relativity and hence there are subtle issues to resolve when trying to formulate a theory which combines 
the two. [13] 



Candidate theories 

There are a number of proposed quantum gravity theories J 14 ^ Currently, there is still no complete and consistent 
quantum theory of gravity, and the candidate models still need to overcome major formal and conceptual problems. 
They also face the common problem that, as yet, there is no way to put quantum gravity predictions to experimental 
tests, although there is hope for this to change as future data from cosmological observations and particle physics 
experiments becomes available J- ^ 
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String theory 




One suggested starting point is ordinary quantum field 

theories which, after all, are successful in describing 

the other three basic fundamental forces in the context 

of the standard model of elementary particle physics. 

However, while this leads to an acceptable effective 

ri7i 

(quantum) field theory of gravity at low energies, 

gravity turns out to be much more problematic at 

higher energies. Where, for ordinary field theories such 

as quantum electrodynamics, a technique known as 

renormalization is an integral part of deriving 

predictions which take into account higher-energy 
n 8i 

contributions, gravity turns out to be 

nonrenormalizable: at high energies, applying the 
recipes of ordinary quantum field theory yields models 
that are devoid of all predictive powerJ 19 ^ 

One attempt to overcome these limitations is to replace 

ordinary quantum field theory, which is based on the 

classical concept of a point particle, with a quantum 

theory of one-dimensional extended objects: string theory At the energies reached in current experiments, these 

strings are indistinguishable from point-like particles, but, crucially, different modes of oscillation of one and the 

same type of fundamental string appear as particles with different (electric and other) charges. In this way, string 

theory promises to be a unified description of all particles and interactions. The theory is successful in that one 

mode will always correspond to a graviton, the messenger particle of gravity; however, the price to pay are unusual 

[221 

features such as six extra dimensions of space in addition to the usual three for space and one for time. 

In what is called the second superstring revolution, it was conjectured that both string theory and a unification of 
general relativity and supersymmetry known as supergravity form part of a hypothesized eleven-dimensional 
model known as M-theory, which would constitute a uniquely defined and consistent theory of quantum gravity J 24 ^ 
^ As presently understood, however, string theory is consistent with an estimated 10 500 equally possible solutions 
(the so-called "string landscape"). 



Projection of a Calabi-Yau manifold, one of the ways of 
compactifying the extra dimensions posited by string theory 
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Loop quantum gravity 

Another approach to quantum gravity starts with the 

canonical quantization procedures of quantum theory. 

Starting with the initial-value-formulation of general 

relativity (cf. the section on evolution equations, 

above), the result is an analogue of the Schrodinger 

equation: the Wheeler-DeWitt equation, which some 

argue is ill-defined P^ A major break-through came 

with the introduction of what are now known as 

Ashtekar variables, which represent geometric gravity 

using mathematical analogues of electric and magnetic 

fields P 1 ^ ^ The resulting candidate for a theory of 

quantum gravity is Loop quantum gravity, in which 

space is represented by a network structure called a 

T291 

spin network, evolving over time in discrete steps. 
[30] [31] [32] 




Simple spin network of the type used in loop quantum gravity 



Other approaches 

There are a number of other approaches to quantum gravity. The approaches differ depending on which features of 
general relativity and quantum theory are accepted unchanged, and which features are modified P 3 ^ t34] Examples 
include: 

Supergravity 

[351 

Path-integral based models of quantum cosmology 
Causal Dynamical Triangulation^ 36 ^ 
Regge calculus 
Causal sets'" 37 -' 
Asymptotic safety 
Twistor models ^ 38 ^ 
Noncommutative geometry. 

[39] 

String-nets giving rise to gapless helicity ±2 excitations with no other gapless excitations 
Acoustic metric and other analog models of gravity 
MacDowell-Mansouri action 
Group field theory 14 ^ 



Weinberg-Witten theorem 

In quantum field theory, the Weinberg-Witten theorem places some constraints on theories of composite 
gravity/emergent gravity. However, recent developments attempt to show that if locality is only approximate and the 
holographic principle is correct, the Weiberg-Witten theorem would not be valid. 
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String Theories 

String theory is a developing theory in particle physics that attempts to reconcile quantum mechanics and general 
relativity It is a contender for the theory of everything (TOE), a manner of describing the known fundamental 
forces and matter in a mathematically complete system. The theory has yet to make testable experimental 
predictions, which a theory must do in order to be considered a part of science. 

String theory mainly posits that the electrons and quarks within an atom are not 0-dimensional objects, but rather 
1-dimensional oscillating lines ("strings"). The earliest string model, the bosonic string, incorporated only bosons, 
although this view developed to the superstring theory, which posits that a connection (a "super symmetry") exists 
between bosons and fermions. String theories also require the existence of several extra, unobservable, dimensions to 
the universe, in addition to the usual four spacetime dimensions. 

The theory has its origins in the dual resonance model (1969). Since that time, the term string theory has developed 
to incorporate any of a group of related superstring theories. Five major string theories were formulated. The main 
differences between each of them were the number of dimensions in which the strings developed and their 
characteristics, however all appeared to be correct. In the mid 1990s a unification of all previous superstring theories, 
called M-theory, was proposed, which asserted that strings are really 1-dimensional slices of a 2-dimensional 
membrane vibrating in 1 1-dimensional space. 

As a result of the many properties and principles shared by these approaches (such as the holographic principle), 
their mutual logical consistency, and the fact that some easily include the standard model of particle physics, some 
mathematical physicists (e.g. Witten, Maldacena and Susskind) believe that string theory is a step towards the correct 
fundamental description of nature ^ ^ ^ Nevertheless, other prominent physicists (e.g. Feynman and Glashow) 
have criticized string theory for not providing any quantitative experimental predictions. ^ ^ 



Overview 

String theory posits that the electrons and quarks within an atom are not 0-dimensional objects, but 1-dimensional 
strings. These strings can move and vibrate, giving the observed particles their flavor, charge, mass and spin. String 
theories also include objects more general than strings, called branes. The word brane, derived from "membrane", 
refers to a variety of interrelated objects, such as D-branes, black p-branes and Neveu-Schwarz 5-branes. These are 
extended objects that are charged sources for differential form generalizations of the vector potential electromagnetic 
field. These objects are related to one another by a variety of dualities. Black hole-like black p-branes are identified 
with D-branes, which are endpoints for strings, and this identification is called Gauge-gravity duality. Research on 
this equivalence has led to new insights on quantum chromodynamics, the fundamental theory of the strong nuclear 
force ^ ^ The strings make closed loops unless they encounter D-branes, where they can open up into 
1-dimensional lines. The endpoints of the string cannot break off the D-brane, but they can slide around on it. 
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Since the string theory is widely believed to be a consistent theory of 
quantum gravity, many hope that it correctly describes our universe, 
making it a theory of everything. There are known configurations which 
describe all the observed fundamental forces and matter but with a zero 
cosmological constant and some new fields. There are other 
configurations with different values of the cosmological constant, which 
are metastable but long-lived. This leads many to believe that there is at 
least one metastable solution which is quantitatively identical with the 
standard model, with a small cosmological constant, which contains 
dark matter and a plausible mechanism for cosmic inflation. It is not yet 
known whether string theory has such a solution, nor how much 
freedom the theory allows to choose the details. 

The full theory does not yet have a satisfactory definition in all 
circumstances, since the scattering of strings is most straightforwardly 
defined by a perturbation theory. The complete quantum mechanics of 
high dimensional branes is not easily defined, and the behavior of string 
theory in cosmological settings (time-dependent backgrounds) is not 
fully worked out. It is also not clear if there is any principle by which 
string theory selects its vacuum state, the spacetime configuration which 
determines the properties of our universe (see string theory landscape). 

Like any other quantum theory of gravity, it is widely believed that 
testing the theory directly would require prohibitively expensive feats of 
engineering. Although direct experimental testing of string theory 
involves grand explorations and development in engineering, there are 
several indirect experiments that may prove partial truth to string theory. 




Levels of magnification: 
1 . Macroscopic level - Matter 
2. Molecular level 
3. Atomic level - Protons, neutrons, and 
electrons 
4. Subatomic level — Electron 
5. Subatomic level - Quarks 
6. String level 



Basic properties 

String theory can be formulated in terms of an action principle, either the Nambu-Goto action or the Polyakov 
action, which describes how strings move through space and time. In the absence of external interactions, string 
dynamics are governed by tension and kinetic energy, which combine to produce oscillations. The quantum 
mechanics of strings implies these oscillations take on discrete vibrational modes, the spectrum of the theory. 

On distance scales larger than the string radius, each oscillation mode behaves as a different species of particle, with 
its mass, spin and charge determined by the string's dynamics. Splitting and recombination of strings correspond to 
particle emission and absorption, giving rise to the interactions between particles. 

An analogy for strings' modes of vibration is a guitar string's production of multiple but distinct musical notes. In the 
analogy, different notes correspond to different particles. The only difference is the guitar is only 2-dimensional; you 
can strum it up, and down. In actuality the guitar strings would be every dimension, and the strings could vibrate in 
any direction, meaning that the particles could move through not only our dimension, but other dimensions as well. 

String theory includes both open strings, which have two distinct endpoints, and closed strings making a complete 
loop. The two types of string behave in slightly different ways, yielding two different spectra. For example, in most 
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string theories, one of the closed string modes is the graviton, and one of the open string modes is the photon. 
Because the two ends of an open string can always meet and connect, forming a closed string, there are no string 
theories without closed strings. 

The earliest string model, the bosonic string, incorporated only bosons. This model describes, in low enough 
energies, a quantum gravity theory, which also includes (if open strings are incorporated as well) gauge fields such 
as the photon (or, more generally, any gauge theory). However, this model has problems. Most importantly, the 
theory has a fundamental instability, believed to result in the decay (at least partially) of spacetime itself. 
Additionally, as the name implies, the spectrum of particles contains only bosons, particles which, like the photon, 
obey particular rules of behavior. Roughly speaking, bosons are the constituents of radiation, but not of matter, 
which is made of fermions. Investigating how a string theory may include fermions in its spectrum led to the 
invention of super symmetry, a mathematical relation between bosons and fermions. String theories which include 
fermionic vibrations are now known as superstring theories; several different kinds have been described, but all are 
now thought to be different limits of M- theory. 

Some qualitative properties of quantum strings can be understood in a fairly simple fashion. For example, quantum 
strings have tension, much like regular strings made of twine; this tension is considered a fundamental parameter of 
the theory. The tension of a quantum string is closely related to its size. Consider a closed loop of string, left to move 
through space without external forces. Its tension will tend to contract it into a smaller and smaller loop. Classical 
intuition suggests that it might shrink to a single point, but this would violate Heisenberg's uncertainty principle. The 
characteristic size of the string loop will be a balance between the tension force, acting to make it small, and the 
uncertainty effect, which keeps it "stretched". Consequently, the minimum size of a string is related to the string 
tension. 

World-sheet 

A point-like particle's motion may be described by drawing a graph of its position (in one or two dimensions of 
space) against time. The resulting picture depicts the worldline of the particle (its 'history') in spacetime. By analogy, 
a similar graph depicting the progress of a string as time passes by can be obtained; the string (a one-dimensional 
object — a small line — by itself) will trace out a surface (a two-dimensional manifold), known as the worldsheet. 
The different string modes (representing different particles, such as photon or graviton) are surface waves on this 
manifold. 

A closed string looks like a small loop, so its worldsheet will look like a pipe or, more generally, a Riemann surface 
(a two-dimensional oriented manifold) with no boundaries (i.e. no edge). An open string looks like a short line, so its 
worldsheet will look like a strip or, more generally, a Riemann surface with a boundary. 
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Strings can split and connect. This is 

reflected by the form of their worldsheet 

(more accurately, by its topology). For 

example, if a closed string splits, its 

worldsheet will look like a single pipe 

splitting (or connected) to two pipes (often 

referred to as a pair of pants — see drawing 

at right). If a closed string splits and its two 

parts later reconnect, its worldsheet will 

look like a single pipe splitting to two and 

then reconnecting, which also looks like a 

torus connected to two pipes (one 

representing the ingoing string, and the 

, , a \ a • Standard Model or a world sheet swept up by closed strings in string theory 

other — the outgoing one). An open string 

doing the same thing will have its 

worldsheet looking like a ring connected to two strips. 

Note that the process of a string splitting (or strings connecting) is a global process of the worldsheet, not a local 
one: locally, the worldsheet looks the same everywhere and it is not possible to determine a single point on the 
worldsheet where the splitting occurs. Therefore these processes are an integral part of the theory, and are described 
by the same dynamics that controls the string modes. 

In some string theories (namely, closed strings in Type I and some versions of the bosonic string), strings can split 
and reconnect in an opposite orientation (as in a Mobius strip or a Klein bottle). These theories are called unoriented. 
Formally, the worldsheet in these theories is a non-orientable surface. 

Dualities 

Before the 1990s, string theorists believed there were five distinct superstring theories: open type I, closed type I, 
closed type IIA, closed type IIB, and the two flavors of heterotic string theory (SO(32) and ExEJ. 1 The thinking 

o o 

was that out of these five candidate theories, only one was the actual correct theory of everything, and that theory 
was the one whose low energy limit, with ten spacetime dimensions compactified down to four, matched the physics 
observed in our world today. It is now believed that this picture was incorrect and that the five superstring theories 
are connected to one another as if they are each a special case of some more fundamental theory (thought to be 
M-theory). These theories are related by transformations that are called dualities. If two theories are related by a 
duality transformation, it means that the first theory can be transformed in some way so that it ends up looking just 
like the second theory. The two theories are then said to be dual to one another under that kind of transformation. Put 
differently, the two theories are mathematically different descriptions of the same phenomena. 

These dualities link quantities that were also thought to be separate. Large and small distance scales, as well as 
strong and weak coupling strengths, are quantities that have always marked very distinct limits of behavior of a 
physical system in both classical field theory and quantum particle physics. But strings can obscure the difference 
between large and small, strong and weak, and this is how these five very different theories end up being related. 
T-duality relates the large and small distance scales between string theories, whereas S-duality relates strong and 
weak coupling strengths between string theories. U-duality links T-duality and S-duality. 
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String theories 


Type 


Spacetime 
dimensions 


Details 


Bosonic 


26 


Only bosons, no fermions, meaning only forces, no matter, with both open and closed strings; major flaw: a particle 
with imaginary mass, called the tachyon, representing an instability in the theory. 


I 


10 


Supersymmetry between forces and matter, with both open and closed strings; no tachyon; group symmetry is 
SO(32) 


IIA 


10 


Supersymmetry between forces and matter, with only closed strings bound to D-branes; no tachyon; massless 
fermions are non-chiral 


IIB 


10 


Supersymmetry between forces and matter, with only closed strings bound to D-branes; no tachyon; massless 
fermions are chiral 


HO 


10 


Supersymmetry between forces and matter, with closed strings only; no tachyon; heterotic, meaning right moving 
and left moving strings differ; group symmetry is SO(32) 


HE 


10 


Supersymmetry between forces and matter, with closed strings only; no tachyon; heterotic, meaning right moving 
and left moving strings differ; group symmetry is E^xE^ 



Note that in the type IIA and type IIB string theories closed strings are allowed to move everywhere throughout the 
ten-dimensional spacetime (called the bulk), while open strings have their ends attached to D-branes, which are 
membranes of lower dimensionality (their dimension is odd — 1,3, 5, 7 or 9 — in type IIA and even — 0, 2, 4, 6 or 
8 — in type IIB, including the time direction). 



Extra dimensions 
Number of dimensions 

An intriguing feature of string theory is that it involves the prediction of extra dimensions. The number of 

dimensions is not fixed by any consistency criterion, but flat spacetime solutions do exist in the so-called "critical 

dimension". Cosmological solutions exist in a wider variety of dimensionalities, and these different 

dimensions — more precisely different values of the "effective central charge", a count of degrees of freedom which 

ri4i 

reduces to dimensionality in weakly curved regimes — are related by dynamical transitions. 

One such theory is the 11 -dimensional M-theory, which requires spacetime to have eleven dimensions' 15 ^ as 
opposed to the usual three spatial dimensions and the fourth dimension of time. The original string theories from the 
1980s describe special cases of M-theory where the eleventh dimension is a very small circle or a line, and if these 
formulations are considered as fundamental, then string theory requires ten dimensions. But the theory also describes 
universes like ours, with four observable spacetime dimensions, as well as universes with up to 10 flat space 
dimensions, and also cases where the position in some of the dimensions is not described by a real number, but by a 
completely different type of mathematical quantity. So the notion of spacetime dimension is not fixed in string 
theory: it is best thought of as different in different circumstances' 16 ^ 

Nothing in Maxwell's theory of electromagnetism or Einstein's theory of relativity makes this kind of prediction; 
these theories require physicists to insert the number of dimensions "by both hands", and this number is fixed and 
independent of potential energy. String theory allows one to relate the number of dimensions to scalar potential 
energy. Technically, this happens because a gauge anomaly exists for every separate number of predicted 
dimensions, and the gauge anomaly can be counteracted by including nontrivial potential energy into equations to 
solve motion. Furthermore, the absence of potential energy in the "critical dimension" explains why flat spacetime 
solutions are possible. 

This can be better understood by noting that a photon included in a consistent theory (technically, a particle carrying 
a force related to an unbroken gauge symmetry) must be massless. The mass of the photon which is predicted by 
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string theory depends on the energy of the string mode which represents the photon. This energy includes a 

contribution from the Casimir effect, namely from quantum fluctuations in the string. The size of this contribution 

depends on the number of dimensions since for a larger number of dimensions; there are more possible fluctuations 

in the string position. Therefore, the photon in flat spacetime will be massless — and the theory consistent — only for a 

ri7i 

particular number of dimensions. When the calculation is done, the critical dimensionality is not four as one may 

expect (three axes of space and one of time). The subset of X is equal to the relation of photon fluctuations in a linear 

dimension. Flat space string theories are 26-dimensional in the bosonic case, while superstring and M-theories turn 

out to involve 10 or 11 dimensions for flat solutions. In bosonic string theories, the 26 dimensions come from the 
n 8i 

Polyakov equation. Starting from any dimension greater than four, it is necessary to consider how these are 
reduced to four dimensional spacetime. 



Compact dimensions 

Two different ways have been proposed to resolve this apparent 
contradiction. The first is to compactify the extra dimensions; i.e., 
the 6 or 7 extra dimensions are so small as to be undetectable by 
present day experiments. 

To retain a high degree of supersymmetry, these compactification 
spaces must be very special, as reflected in their holonomy. A 
6-dimensional manifold must have SU(3) structure, a particular 
case (torsionless) of this being SU(3) holonomy, making it a 
Calabi-Yau space, and a 7-dimensional manifold must have G 2 
structure, with G 2 holonomy again being a specific, simple, case. 
Such spaces have been studied in attempts to relate string theory to 
the 4-dimensional Standard Model, in part due to the 
computational simplicity afforded by the assumption of 
supersymmetry. More recently, progress has been made 
constructing more realistic compactifications without the degree of symmetry of Calabi-Yau or G2 manifolds. 




Calabi-Yau manifold (3D projection) 



A standard analogy for this is to consider multidimensional space as a garden hose. If the hose is viewed from a 
sufficient distance, it appears to have only one dimension, its length. Indeed, think of a ball just small enough to 
enter the hose. Throwing such a ball inside the hose, the ball would move more or less in one dimension; in any 
experiment we make by throwing such balls in the hose, the only important movement will be one-dimensional, that 
is, along the hose. However, as one approaches the hose, one discovers that it contains a second dimension, its 
circumference. Thus, an ant crawling inside it would move in two dimensions (and a fly flying in it would move in 
three dimensions). This "extra dimension" is only visible within a relatively close range to the hose, or if one "throws 
in" small enough objects. Similarly, the extra compact dimensions are only "visible" at extremely small distances, or 
by experimenting with particles with extremely small wavelengths (of the order of the compact dimension's radius), 
which in quantum mechanics means very high energies (see wave-particle duality). 



Brane-world scenario 

Another possibility is that we are "stuck" in a 3+1 dimensional (i.e. three spatial dimensions plus the time 
dimension) subspace of the full universe. If such sub-spacetimes are exceptional sets within a larger-dimensional 
one, there typically exist properly localized matter and Yang-Mills gauge fields J 19 ^ These "exceptional sets" are 
ubiquitous in Calabi-Yau rc-folds and may be described as subspaces without local deformations, akin to a crease in a 
sheet of paper or a crack in a crystal, the neighborhood of which is markedly different from the exceptional subspace 
itself. However, until the work of Randall and SundrumJ 20 ^ it was not known that gravity too can be properly 
localized to a sub- spacetime; their proof that it can made such cosmological scenarios realistic. In addition, 
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spacetime may well be stratified, containing strata of various dimensions so that we may be inhabiting a 3+1 
dimensional stratum; such geometries occur naturally in Calabi-Yau compactifications. Such sub-spacetimes are 
supposed to be D-branes, hence such models are known as a brane- world theories. 

Effect of the hidden dimensions 

In either case, gravity acting in the hidden dimensions affects other non-gravitational forces such as 
electromagnetism. In fact, Kaluza's early work demonstrated that general relativity in five dimensions actually 
predicts the existence of electromagnetism. However, because of the nature of Calabi-Yau manifolds, no new forces 
appear from the small dimensions, but their shape has a profound effect on how the forces between the strings appear 
in our four-dimensional universe. In principle, therefore, it is possible to deduce the nature of those extra dimensions 
by requiring consistency with the standard model, but this is not yet a practical possibility. It is also possible to 
extract information regarding the hidden dimensions by precision tests of gravity, but so far these have only put 
upper limitations on the size of such hidden dimensions. 

D-branes 

Another key feature of string theory is the existence of D-branes. These are membranes of different dimensionality 
(anywhere from a zero dimensional membrane — which is in fact a point — and up, including 2-dimensional 
membranes, 3 -dimensional volumes and so on). 

D-branes are defined by the fact that worldsheet boundaries are attached to them. D-branes have mass, since they 
emit and absorb closed strings which describe gravitons, and — in superstring theories — charge as well, since they 
couple to open strings which describe gauge interactions. 

From the point of view of open strings, D-branes are objects to which the ends of open strings are attached. The open 
strings attached to a D-brane are said to "live" on it, and they give rise to gauge theories "living" on it (since one of 
the open string modes is a gauge boson such as the photon). In the case of one D-brane there will be one type of a 
gauge boson and we will have an Abelian gauge theory (with the gauge boson being the photon). If there are 
multiple parallel D-branes there will be multiple types of gauge bosons, giving rise to a non- Abelian gauge theory. 

D-branes are thus gravitational sources, on which a gauge theory "lives". This gauge theory is coupled to gravity 
(which is said to exist in the bulk), so that normally each of these two different viewpoints is incomplete. 

Gauge-gravity duality 

Gauge-gravity duality is a conjectured duality between a quantum theory of gravity in certain cases and gauge theory 
in a lower number of dimensions. This means that each predicted phenomenon and quantity in one theory has an 
analogue in the other theory, with a "dictionary" translating from one theory to the other. 

Description of the duality 

In certain cases the gauge theory on the D-branes is decoupled from the gravity living in the bulk; thus open strings 
attached to the D-branes are not interacting with closed strings. Such a situation is termed a decoupling limit. 

In those cases, the D-branes have two independent alternative descriptions. As discussed above, from the point of 
view of closed strings, the D-branes are gravitational sources, and thus we have a gravitational theory on spacetime 
with some background fields. From the point of view of open strings, the physics of the D-branes is described by the 
appropriate gauge theory. Therefore in such cases it is often conjectured that the gravitational theory on spacetime 
with the appropriate background fields is dual (i.e. physically equivalent) to the gauge theory on the boundary of this 
spacetime (since the subspace filled by the D-branes is the boundary of this spacetime). So far, this duality has not 
been proven in any cases, so there is also disagreement among string theorists regarding how strong the duality 
applies to various models. 
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Examples and intuition 

The best known example and the first one to be studied is the duality between Type IIB superstring on A dS 5 X S 5 (a 
product space of a five-dimensional Anti de Sitter space and a five-sphere) on one hand, and N = 4 supersymmetric 
Yang-Mills theory on the four-dimensional boundary of the Anti de Sitter space (either a flat four-dimensional 
spacetime R 3,1 or a three-sphere with time S 3 X R). This is known as the AdS/CFT correspondence, t22] t23] t24] t25] 
a name often used for Gauge / gravity duality in general. 

This duality can be thought of as follows: suppose there is a spacetime with a gravitational source, for example an 
extremal black holeJ 26 ^ When particles are far away from this source, they are described by closed strings (i.e. a 
gravitational theory, or usually supergravity). As the particles approach the gravitational source, they can still be 

T271 T281 T291 

described by closed strings; alternatively, they can be described by objects similar to QCD strings, 1 which 
are made of gauge bosons (gluons) and other gauge theory degrees of freedom P 0 ^ So if one is able (in a decoupling 
limit) to describe the gravitational system as two separate regions — one (the bulk) far away from the source, and the 
other close to the source — then the latter region can also be described by a gauge theory on D-branes. This latter 
region (close to the source) is termed the near-horizon limit, since usually there is an event horizon around (or at) the 
gravitational source. 

In the gravitational theory, one of the directions in spacetime is the radial direction, going from the gravitational 
source and away (towards the bulk). The gauge theory lives only on the D-brane itself, so it does not include the 
radial direction: it lives in a spacetime with one less dimension compared to the gravitational theory (in fact, it lives 
on a spacetime identical to the boundary of the near-horizon gravitational theory). Let us understand how the two 
theories are still equivalent: 

The physics of the near-horizon gravitational theory involves only on-shell states (as usual in string theory), while 
the field theory includes also off-shell correlation function. The on-shell states in the near-horizon gravitational 
theory can be thought of as describing only particles arriving from the bulk to the near-horizon region and interacting 
there between themselves. In the gauge theory these are "projected" onto the boundary, so that particles which arrive 
at the source from different directions will be seen in the gauge theory as (off-shell) quantum fluctuations far apart 
from each other, while particles arriving at the source from almost the same direction in space will be seen in the 
gauge theory as (off-shell) quantum fluctuations close to each other. Thus the angle between the arriving particles in 
the gravitational theory translates to the distance scale between quantum fluctuations in the gauge theory. The angle 
between arriving particles in the gravitational theory is related to the radial distance from the gravitational source at 
which the particles interact: the larger the angle, the closer the particles have to get to the source in order to interact 
with each other. On the other hand, the scale of the distance between quantum fluctuations in a quantum field theory 
is related (inversely) to the energy scale in this theory, so small radius in the gravitational theory translates to low 
energy scale in the gauge theory (i.e. the IR regime of the field theory) while large radius in the gravitational theory 
translates to high energy scale in the gauge theory (i.e. the UV regime of the field theory). 

A simple example to this principle is that if in the gravitational theory there is a setup in which the dilaton field 
(which determines the strength of the coupling) is decreasing with the radius, then its dual field theory will be 
asymptotically free, i.e. its coupling will grow weaker in high energies. 
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Contact with experiment 

The mathematics of string theory may lead to new insights on quantum chromodynamics, a gauge theory which is 

the fundamental theory of the strong nuclear force. To this end, it is hoped that a gravitational theory dual to 

T311 

quantum chromodynamics will be found. 

A mathematical technique from string theory (the AdS/CFT correspondence) has been used to describe qualitative 
features of quark-gluon plasma behavior in relativistic heavy-ion collisions;^ 111 ^ 81 ^ 91 ^ 101 the physics, however, is 
strictly that of standard quantum chromodynamics, which has been quantitatively modeled by lattice QCD methods 
with good results' 321 

Other potential forms of evidence for string theory have been proposed. One would be the discovery of large cosmic 
strings in space, formed when the Big Bang "stretched" some strings to astronomical proportions. Other theories, 
however, predict cosmic strings with a different physical basis (topological defects in various fields). Novel 
phenomena consistent with versions of string theory may be observed with the newly built Large Hadron Collider. 
One would be indications of an anomalously large strength of gravity on a microscopic scale, which would be 
consistent with branes interacting across extra dimensions through leakage of gravitons from one to the other. The 
discovery of supersymmetry could also be considered evidence since string theory was the first theory to require it, 
though other theories incorporate supersymmetry as well. The absence of supersymmetric particles at energies 
accessible to the LHC would not necessarily disprove string theory, since supersymmetry could exist but still be 
outside the accelerator's range. 



Problems and controversy 

Although string theory comes from physics, some say that string theory's current untestable status means that it 

T331 

should be classified as more of a mathematical framework for building models as opposed to a physical theory. 1 
Some go further, and say that string theory as a theory of everything is a failure. This led to a public debate in 

2007 l-'j with 

one journalist expressing this opinion: 

"For more than a generation, physicists have been chasing a will-o'-the-wisp called string theory. The 
beginning of this chase marked the end of what had been three-quarters of a century of progress. Dozens of 
string-theory conferences have been held, hundreds of new Ph.D.s have been minted, and thousands of papers 
have been written. Yet, for all this activity, not a single new testable prediction has been made, not a single 
theoretical puzzle has been solved. In fact, there is no theory so far — just a set of hunches and calculations 
suggesting that a theory might exist. And, even if it does, this theory will come in such a bewildering number 
of versions that it will be of no practical use: a Theory of Nothing." 

—Jim Holt [38] 



Is string theory predictive? 

String theory as a theory of everything has been criticized as unscientific because it is so difficult to test by 
experiments. The controversy concerns two properties: 

1 . It is widely believed that any theory of quantum gravity would require extremely high energies to probe directly, 
higher by orders of magnitude than those that current experiments such as the Large Hadron Collider 1 can 
reach. 

2. String theory as it is currently understood has a huge number of equally possible solutions, called string vacua, ^ 
and these vacua might be sufficiently diverse to explain almost any phenomena we might observe at lower 
energies. 

If these properties are true, string theory as a theory of everything would have little or no predictive power for low 
energy particle physics experiments.^ 411 ^ 421 Because the theory is so difficult to test, some theoretical physicists have 
asked if it can even be called a scientific theory. Notable critics include Peter Woit, Lee Smolin, Philip Warren 
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Anderson, Sheldon Glashow, Lawrence Krauss, and Carlo Rovelli. 

All string theory models are quantum mechanical, Lorentz invariant, unitary and contain Einstein's General 
Relativity as a low energy limit J 47 -' Therefore to falsify string theory, it would suffice to falsify quantum mechanics, 
Lorentz invariance, or general relativity J 48 ^ Hence string theory is falsifiable and meets the definition of scientific 
theory according to the Popperian criterion. However to constitute a convincing potential verification of string 
theory, a prediction should be specific to it, not shared by any quantum field theory model or by General Relativity. 

One such unique prediction is string harmonics: at sufficiently high energies — probably near the quantum gravity 
scale — the string-like nature of particles would become obvious. There should be heavier copies of all particles 
corresponding to higher vibrational states of the string. But it is not clear how high these energies are. In the most 
likely case, they would be 10 14 times higher than those accessible in the newest particle accelerator, the LHC, 
making this prediction impossible to test with any particle accelerator in the foreseeable future. 

Swampland 

In response to these concerns, Cumrun Vafa and others have challenged the idea that string theory is compatible with 
anything. They propose that most possible theories of low energy physics lie in the swampland. The swampland is 
the collection of theories which could be true if gravity wasn't an issue, but which are not compatible with string 
theory. An example of a theory in the swampland is quantum electrodynamics in the limit of very small electron 
charge. This limit is perfectly fine in quantum field theory — in fact, in this limit, the perturbation theory becomes 
better and better. But in string theory, at the moment the charge of the lightest charged particle becomes less than the 
mass in natural units, the theory becomes inconsistent. 

The reason is that two such charged massive particles will attract each other gravitationally more than they repel 
each other electrostatically, and could be used to form black holes. If there are no light charged particles, these black 
holes could not decay efficiently, barring improbable conspiracies or remnants. From the study of examples, and 
from the analysis of black-hole evaporation, it is now accepted that theories with a small charge quantum must come 
with light charged particles. This is only true within string theory — there is no such restriction in quantum field 
theory. This means that the discovery of a new gauge group with a small quantum of charge and only heavy charged 
particles would falsify string theory. Since this argument is very general — relying only on black-hole evaporation 
and the holographic principle, it has been suggested that this prediction would be true of any consistent holographic 
theory of quantum gravity, although the phrase "consistent holographic theory of quantum gravity" might very well 
be synonymous with "String Theory". 

It is notable that all the gross features of the Standard model can be embedded within String theory, so that the 
standard model is not in the swampland. This includes features such as non-abelian gauge groups and chiral fermions 
which are hard to incorporate in non-string theories of quantum gravity. 

Background independence 

A separate and older criticism of string theory is that it is background-dependent — string theory describes 
perturbative expansions about fixed spacetime backgrounds. Although the theory has some 
background-independence — topology change is an established process in string theory, and the exchange of 
gravitons is equivalent to a change in the background — mathematical calculations in the theory rely on preselecting 
a background as a starting point. This is because, like many quantum field theories, much of string theory is still only 
formulated perturbatively, as a divergent series of approximations. Although nonperturbative techniques have 
progressed considerably — including conjectured complete definitions in spacetimes satisfying certain asymptotics 
— a full non-perturbative definition of the theory is still lacking. Some see background independence as a 
fundamental requirement of a theory of quantum gravity, particularly since general relativity is already background 
independent. Some hope that M-theory, or a non-perturbative treatment of string theory (string field theory was 
thought to be non-perturbative in the 1980s) have a background-independent formulation. 
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Supersymmetry breaking 

A central problem for applications is that the best understood backgrounds of string theory preserve much of the 
supersymmetry of the underlying theory, which results in time-invariant spacetimes: currently string theory cannot 
deal well with time-dependent, cosmological backgrounds. However, several models have been proposed to explain 
supersymmetry breaking, most notably the KKLT model,'- 40 -' which incorporates branes and fluxes to make a 
metastable compactification. 

String theory landscape 

The vacuum structure of the theory, called the string theory landscape (or the anthropic portion of string theory 

vacua), is not well understood. String theory contains an infinite number of distinct meta- stable vacua, and perhaps 
500 

10 of these or more correspond to a universe roughly similar to ours — with four dimensions, a high planck scale, 
gauge groups, and chiral fermions. Each of these corresponds to a different possible universe, with a different 
collection of particles and forces. t40] What principle, if any, can be used to select among these vacua is an open 
issue. While there are no continuous parameters in the theory, there is a very large set of possible universes, which 
may be radically different from each other. 

Some physicists believe this is a good thing, because it may allow a natural anthropic explanation of the observed 
values of physical constants, in particular the small value of the cosmological constant J 49 ^ ^ The argument is that 
most universes contain values for physical constants which do not lead to habitable universes (at least for humans), 
and so we happen to live in the most "friendly" universe. This principle is already employed to explain the existence 
of life on earth as the result of a life-friendly orbit around the medium- sized sun among an infinite number of 
possible orbits (as well as a relatively stable location in the galaxy). However, the cosmological version of the 
anthropic principle remains highly controversial because it would be difficult if not impossible to Popper falsify; so 
many do not accept it as scientific. 

Other testability criteria 

Many physicists strongly oppose the idea that string theory is not falsifiable, among them Sylvester James Gates: 
"So, the next time someone tells you that string theory is not testable, remind them of the AdS/CFT connection 
..."^ AdS/CFT relates string theory to gauge theory, and allows contact with low energy experiments in quantum 
chromodynamics. This type of string theory, which only describes the strong interactions, is much less controversial 
today than string theories of everything (although two decades ago, it was the other way around). 

In addition, Gates points out that the grand unification natural in string theories of everything requires that the 

coupling constants of the four forces meet at one point under renormalization group rescaling. This is also a 

falsifiable statement, but it is not restricted to string theory, but is shared by grand unified theories. The LHC will 

T531 

be used both for testing AdS/CFT, and to check if the electro weakstrong unification does happen as predicted. 



History 

Some of the structures reintroduced by string theory arose for the first time much earlier as part of the program of 
classical unification started by Albert Einstein. The first person to add a fifth dimension to general relativity was 
German mathematician Theodor Kaluza in 1919, who noted that gravity in five dimensions describes both gravity 
and electromagnetism in four. In 1926, the Swedish physicist Oskar Klein gave a physical interpretation of the 
unobservable extra dimension — it is wrapped into a small circle. Einstein introduced a non-symmetric metric tensor, 
while much later Brans and Dicke added a scalar component to gravity. These ideas would be revived within string 
theory, where they are demanded by consistency conditions. 

String theory was originally developed during the late 1960s and early 1970s as a never completely successful theory 
of hadrons, the subatomic particles like the proton and neutron which feel the strong interaction. In the 1960s, 
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Geoffrey Chew and Steven Frautschi discovered that the mesons make families called Regge trajectories with 
masses related to spins in a way that was later understood by Yoichiro Nambu, Holger Bech Nielsen and Leonard 
Susskind to be the relationship expected from rotating strings. Chew advocated making a theory for the interactions 
of these trajectories which did not presume that they were composed of any fundamental particles, but would 
construct their interactions from self-consistency conditions on the S-matrix. The S-matrix approach was started by 
Werner Heisenberg in the 1940s as a way of constructing a theory which did not rely on the local notions of space 
and time, which Heisenberg believed break down at the nuclear scale. While the scale was off by many orders of 
magnitude, the approach he advocated was ideally suited for a theory of quantum gravity. 

Working with experimental data, R. Dolen, D. Horn and C. Schmid^ 54 ^ developed some sum rules for hadron 
exchange. When a particle and antiparticle scatter, virtual particles can be exchanged in two qualitatively different 
ways. In the s-channel, the two particles annihilate to make temporary intermediate states which fall apart into the 
final state particles. In the t-channel, the particles exchange intermediate states by emission and absorption. In field 
theory, the two contributions add together, one giving a continuous background contribution, the other giving peaks 
at certain energies. In the data, it was clear that the peaks were stealing from the background— the authors 
interpreted this as saying that the t-channel contribution was dual to the s-channel one, meaning both described the 
whole amplitude and included the other. 

The result was widely advertised by Murray Gell-Mann, leading Gabriele Veneziano to construct a scattering 
amplitude which had the property of Dolen-Horn-Schmid duality, later renamed world-sheet duality. The amplitude 
needed poles where the particles appear, on straight line trajectories, and there is a special mathematical function 
whose poles are evenly spaced on half the real line — the Gamma function— which was widely used in Regge 
theory. By manipulating combinations of Gamma functions, Veneziano was able to find a consistent scattering 
amplitude with poles on straight lines, with mostly positive residues, which obeyed duality and had the appropriate 
Regge scaling at high energy. The amplitude could fit near-beam scattering data as well as other Regge type fits, and 
had a suggestive integral representation which could be used for generalization. 

Over the next years, hundreds of physicists worked to complete the bootstrap program for this model, with many 
surprises. Veneziano himself discovered that for the scattering amplitude to describe the scattering of a particle 
which appears in the theory, an obvious self-consistency condition, the lightest particle must be a tachyon. Miguel 
Virasoro and Joel Shapiro found a different amplitude now understood to be that of closed strings, while Ziro Koba 
and Holger Nielsen generalized Veneziano's integral representation to multiparticle scattering. Veneziano and Sergio 
Fubini introduced an operator formalism for computing the scattering amplitudes which was a forerunner of 
world- sheet conformal theory, while Virasoro understood how to remove the poles with wrong- sign residues using a 
constraint on the states. Claud Lovelace calculated a loop amplitude, and noted that there is an inconsistency unless 
the dimension of the theory is 26. Charles Thorn, Peter Goddard and Richard Brower went on to prove that there are 
no wrong-sign propagating states in dimensions less than or equal to 26. 

In 1969 Yoichiro Nambu, Holger Bech Nielsen and Leonard Susskind recognized that the theory could be given a 
description in space and time in terms of strings. The scattering amplitudes were derived systematically from the 
action principle by Peter Goddard, Jeffrey Goldstone, Claudio Rebbi and Charles Thorn, giving a space-time picture 
to the vertex operators introduced by Veneziano and Fubini and a geometrical interpretation to the Virasoro 
conditions. 

In 1970, Pierre Ramond added fermions to the model, which led him to formulate a two-dimensional supersymmetry 
to cancel the wrong-sign states. John Schwarz and Andre Neveu added another sector to the fermi theory a short time 
later. In the fermion theories, the critical dimension was 10. Stanley Mandelstam formulated a world sheet conformal 
theory for both the bose and fermi case, giving a two-dimensional field theoretic path-integral to generate the 
operator formalism. Michio Kaku and Keiji Kikkawa gave a different formulation of the bosonic string, as a string 
field theory, with infinitely many particle types and with fields taking values not on points, but on loops and curves. 
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In 1974, Tamiaki Yoneya discovered that all the known string theories included a massless spin-two particle which 
obeyed the correct Ward identities to be a graviton. John Schwarz and Joel Scherk came to the same conclusion and 
made the bold leap to suggest that string theory was a theory of gravity, not a theory of hadrons. They reintroduced 
Kaluza-Klein theory as a way of making sense of the extra dimensions. At the same time, quantum chromodynamics 
was recognized as the correct theory of hadrons, shifting the attention of physicists and apparently leaving the 
bootstrap program in the dustbin of history. 

String theory eventually made it out of the dustbin, but for the following decade all work on the theory was 
completely ignored. Still, the theory continued to develop at a steady pace thanks the work of a handful of devotees. 
Ferdinando Gliozzi, Joel Scherk, and David Olive realized in 1976 that the original Ramond and Neveu 
Schwarz-strings were separately inconsistent and needed to be combined. The resulting theory did not have a 
tachyon, and was proven to have space-time supersymmetry by John Schwarz and Michael Green in 1981. The same 
year, Alexander Polyakov gave the theory a modern path integral formulation, and went on to develop conformal 
field theory extensively. In 1979, Daniel Friedan showed that the equations of motions of string theory, which are 
generalizations of the Einstein equations of General Relativity, emerge from the Renormalization group equations 
for the two-dimensional field theory. Schwarz and Green discovered T-duality, and constructed two different 
superstring theories — IIA and IIB related by T-duality, and type I theories with open strings. The consistency 
conditions had been so strong, that the entire theory was nearly uniquely determined, with only a few discrete 
choices. 

In the early 1980s, Edward Witten discovered that most theories of quantum gravity could not accommodate chiral 
fermions like the neutrino. This led him, in collaboration with Luis Alvarez-Gaume to study violations of the 
conservation laws in gravity theories with anomalies, concluding that type I string theories were inconsistent. Green 
and Schwarz discovered a contribution to the anomaly that Witten and Alvarez-Gaume had missed, which restricted 
the gauge group of the type I string theory to be SO(32). In coming to understand this calculation, Edward Witten 
became convinced that string theory was truly a consistent theory of gravity, and he became a high-profile advocate. 
Following Witten's lead, between 1984 and 1986, hundreds of physicists started to work in this field, and this is 
sometimes called the first superstring revolution. 

During this period, David Gross, Jeffrey Harvey, Emil Martinec, and Ryan Rohm discovered heterotic strings. The 
gauge group of these closed strings was two copies of E8, and either copy could easily and naturally include the 
standard model. Philip Candelas, Gary Horowitz, Andrew Strominger and Edward Witten found that the Calabi-Yau 
manifolds are the compactifications which preserve a realistic amount of supersymmetry, while Lance Dixon and 
others worked out the physical properties of orbifolds, distinctive geometrical singularities allowed in string theory. 
Cumrun Vafa generalized T-duality from circles to arbitrary manifolds, creating the mathematical field of mirror 
symmetry. David Gross and Vipul Periwal discovered that string perturbation theory was divergent in a way that 
suggested that new non-perturbative objects were missing. 

In the 1990s, Joseph Polchinski discovered that the theory requires higher-dimensional objects, called D-branes and 
identified these with the black-hole solutions of supergravity. These were understood to be the new objects suggested 
by the perturbative divergences, and they opened up a new field with rich mathematical structure. It quickly became 
clear that D-branes and other p-branes, not just strings, formed the matter content of the string theories, and the 
physical interpretation of the strings and branes was revealed— they are a type of black hole. Leonard Susskind had 
incorporated the holographic principle of Gerardus 't Hooft into string theory, identifying the long highly-excited 
string states with ordinary thermal black hole states. As suggested by 't Hooft, the fluctuations of the black hole 
horizon, the world-sheet or world- volume theory, describes not only the degrees of freedom of the black hole, but all 
nearby objects too. 

In 1995, at the annual conference of string theorists at the University of Southern California (USC), Edward Witten 
gave a speech on string theory that essentially united the five string theories that existed at the time, and giving birth 
to a new 11 -dimensional theory called M-theory. M-theory was also foreshadowed in the work of Paul Townsend at 
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approximately the same time. The flurry of activity which began at this time is sometimes called the second 
superstring revolution. 

During this period, Tom Banks, Willy Fischler Stephen Shenker and Leonard Susskind formulated a full holographic 
description of M-theory on IIA DO branes, ^ the first definition of string theory that was fully non-perturbative and 
a concrete mathematical realization of the holographic principle. Andrew Strominger and Cumrun Vafa calculated 
the entropy of certain configurations of D-branes and found agreement with the semi-classical answer for extreme 
charged black holes. Petr Horava and Edward Witten found the eleven-dimensional formulation of the heterotic 
string theories, showing that orbifolds solve the chirality problem. Witten noted that the effective description of the 
physics of D-branes at low energies is by a supersymmetric gauge theory, and found geometrical interpretations of 
mathematical structures in gauge theory that he and Nathan Seiberg had earlier discovered in terms of the location of 
the branes. 

In 1997 Juan Maldacena noted that the low energy excitations of a theory near a black hole consist of objects close to 
the horizon, which for extreme charged black holes looks like an anti de Sitter space. He noted that in this limit the 
gauge theory describes the string excitations near the branes. So he hypothesized that string theory on a near-horizon 
extreme-charged black-hole geometry, an anti-deSitter space times a sphere with flux, is equally well described by 
the low-energy limiting gauge theory, the N=4 supersymmetric Yang-Mills theory. This hypothesis, which is called 
the AdS/CFT correspondence, was further developed by Steven Gubser, Igor Klebanov and Alexander Polyakov, 
and by Edward Witten, and it is now well-accepted. It is a concrete realization of the holographic principle, which 
has far-reaching implications for black holes, locality and information in physics, as well as the nature of the 
gravitational interaction. Through this relationship, string theory has been shown to be related to gauge theories like 
quantum chromodynamics and this has led to more quantitative understanding of the behavior of hadrons, bringing 
string theory back to its roots. 

See also 

• List of string theory topics 

• Conformal field theory 

• F-theory 

• Fuzzballs 

• Little string theory 

• Loop quantum gravity 

• Relationship between string theory and quantum field theory 

• String cosmology 

• Supergravity 

• Zeta function regularization 

• The Elegant Universe 
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Superstring Theories 

Superstring theory is an attempt to explain all of the particles and fundamental forces of nature in one theory by 
modelling them as vibrations of tiny supersymmetric strings. Superstring theory is a shorthand for supersymmetric 
string theory because unlike bosonic string theory, it is the version of string theory that incorporates fermions and 
supersymmetry. So far the theory has made no quantitative experimental predictions, so that the theory could be 
falsified (tested) [1] [2] . 

Background 

The deepest problem in theoretical physics is harmonizing the theory of general relativity, which describes 
gravitation and applies to large-scale structures (stars, galaxies, super clusters), with quantum mechanics, which 
describes the other three fundamental forces acting on the atomic scale. 

The development of a quantum field theory of a force invariably results in infinite (and therefore useless) 
probabilities. Physicists have developed mathematical techniques (renormalization) to eliminate these infinities 
which work for three of the four fundamental forces - electromagnetic, strong nuclear and weak nuclear forces - but 
not for gravity. The development of a quantum theory of gravity must therefore come about by different means than 
those used for the other forces. 

Basic idea 

-33 

The basic idea is that the fundamental constituents of reality are strings of the Planck length (about 10 cm) which 
vibrate at resonant frequencies. Every string in theory has a unique resonance, or harmonic. Different harmonics 
determine different fundamental forces. The tension in a string is on the order of the Planck force (10 44 newtons). 
The graviton (the proposed messenger particle of the gravitational force), for example, is predicted by the theory to 
be a string with wave amplitude zero. Another key insight provided by the theory is that no measurable differences 
can be detected between strings that wrap around dimensions smaller than themselves and those that move along 
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larger dimensions (i.e., effects in a dimension of size R equal those whose size is l/R). Singularities are avoided 
because the observed consequences of "Big Crunches" never reach zero size. In fact, should the universe begin a 
"big crunch" sort of process, string theory dictates that the universe could never be smaller than the size of a string, 
at which point it would actually begin expanding. 

Extra dimensions 

See also: Why does consistency require 10 dimensions? 

Our physical space is observed to have only three large dimensions and — taken together with duration as the fourth 
dimension — a physical theory must take this into account. However, nothing prevents a theory from including more 
than 4 dimensions. In the case of string theory, consistency requires spacetime to have 10 (3+1+6) dimensions. The 
conflict between observation and theory is resolved by making the unobserved dimensions compactified. 

Our minds have difficulty visualizing higher dimensions because we can only move in three spatial dimensions. One 
way of dealing with this limitation is not to try to visualize higher dimensions at all, but just to think of them as extra 
numbers in the equations that describe the way the world works. This opens the question of whether these 'extra 
numbers' can be investigated directly in any experiment (which must show different results in 1 , 2, or 2 + 1 
dimensions to a human scientist). This, in turn, raises the question of whether models that rely on such abstract 
modelling (and potentially impossibly huge experimental apparatuses) can be considered scientific. Six-dimensional 
Calabi-Yau shapes can account for the additional dimensions required by superstring theory. The theory states that 
every point in space (or whatever we had previously considered a point) is in fact a very small manifold where each 
extra dimension has a size on the order of the Planck length. 

Superstring theory is not the first theory to propose extra spatial dimensions; the Kaluza- Klein theory had done so 
previously. Modern string theory relies on the mathematics of folds, knots, and topology, which were largely 
developed after Kaluza and Klein, and has made physical theories relying on extra dimensions much more credible. 

Number of superstring theories 

Theoretical physicists were troubled by the existence of five separate string theories. A possible solution for this 
dilemma was suggested at the beginning of what is called the second superstring revolution in the 1990s, which 
suggests that the five string theories might be different limits of a single underlying theory, called M-theory. 
Unfortunately, however, to this date this remains a conjecture. 
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Bosonic (closed) 
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no 
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no 
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no 



The five consistent superstring theories are: 
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• The type I string has one supersymmetry in the ten-dimensional sense (16 supercharges). This theory is special in 
the sense that it is based on unoriented open and closed strings, while the rest are based on oriented closed strings. 

• The type II string theories have two supersymmetries in the ten-dimensional sense (32 supercharges). There are 
actually two kinds of type II strings called type IIA and type IIB. They differ mainly in the fact that the IIA theory 
is non-chiral (parity conserving) while the IIB theory is chiral (parity violating). 

• The heterotic string theories are based on a peculiar hybrid of a type I superstring and a bosonic string. There are 
two kinds of heterotic strings differing in their ten-dimensional gauge groups: the heterotic E 0 xE 0 string and the 

o o 

heterotic SO(32) string. (The name heterotic SO(32) is slightly inaccurate since among the SO(32) Lie groups, 
string theory singles out a quotient Spin(32)/Z 2 that is not equivalent to SO(32).) 

Chiral gauge theories can be inconsistent due to anomalies. This happens when certain one-loop Feynman diagrams 
cause a quantum mechanical breakdown of the gauge symmetry. The anomalies were canceled out via the 
Green-Schwarz mechanism. 

Please note that the number of superstring theories given above is only a high-level classification; the actual number 
of mathematically distinct theories which are compatible with observation and would therefore have to be examined 
to find the one that correctly describes nature is currently believed to be at least 10 500 (a one with five hundred 
zeroes). This has given rise to the concern that superstring theories, despite the alluring simplicity of their basic 
principles, are, in fact, not simple at all, and according to the principle of Occam's razor perhaps alternative physical 
theories going beyond the Standard Model should be explored. This is aggravated by the fact that it is exceedingly 
hard to make predictions from any superstring theory which can be falsified by experiment, and in fact no current 
superstring theory makes any falsifiable prediction. 

Integrating general relativity and quantum mechanics 

General relativity typically deals with situations involving large mass objects in fairly large regions of spacetime 
whereas quantum mechanics is generally reserved for scenarios at the atomic scale (small spacetime regions). The 
two are very rarely used together, and the most common case in which they are combined is in the study of black 
holes. Having "peak density", or the maximum amount of matter possible in a space, and very small area, the two 
must be used in synchrony in order to predict conditions in such places; yet, when used together, the equations fall 
apart, spitting out impossible answers, such as imaginary distances and less than one dimension. 

The major problem with their congruence is that, at Planck scale (a fundamental small unit of length) lengths, 
general relativity predicts a smooth, flowing surface, while quantum mechanics predicts a random, warped surface, 
neither of which are anywhere near compatible. Superstring theory resolves this issue, replacing the classical idea of 
point particles with loops. These loops have an average diameter of the Planck length, with extremely small 
variances, which completely ignores the quantum mechanical predictions of Planck-scale length dimensional 
warping. 
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The five superstring interactions 

There are five ways open and closed strings can interact. An 
interaction in superstring theory is a topology changing event. Since 
superstring theory has to be a local theory to obey causality the 
topology change must only occur at a single point. If C represents a 
closed string and O an open string, then the five interactions are OOO, 
CCC, 0000, CO and COO. 

All open superstring theories also contain closed superstrings since 

closed superstrings can be seen from the fifth interaction, and they are 

unavoidable. Although all these interactions are possible, in practice 

The five superstring interactions 

the most used superstring model is the closed heterotic E8xE8 
superstring which only has closed strings and so only the second 
interaction (CCC) is needed. 

The mathematics 

The single most important equation in (first quantisized bosonic) string theory is the N-point scattering amplitude. 
This treats the incoming and outgoing strings as points, which in string theory are tachyons, with momentum k. 
which connect to a string world surface at the surface points z.. It is given by the following functional integral which 
integrates (sums) over all possible embeddings of this 2D surface in 27 dimensions. 

A N = J Dfij D[X] exp ( - 4^ J dzX^z, z)d^ x ^(z,z) dz 2 fc^A^Z;, z { ) 

The functional integral can be done because it is a Gaussian to become: 

a n = f Dp n \ Zi - Zj \ 2 ^ 

o<;<j<iv+i 

This is integrated over the various points z.. Special care must be taken because two parts of this complex region may 
represent the same point on the 2D surface and you don't want to integrate over them twice. Also you need to make 
sure you are not integrating multiple times over different parameterizations of the surface. When this is taken into 
account it can be used to calculate the 4-point scattering amplitude (the 3 -point amplitude is simply a delta function): 

r(-i + \{k! + fc 2 ) 2 )r(-i + |(fc 2 + fc 3 ) 2 ) 
4 r(-2 + !((*! + k 2 y + (k 2 + fc 3 ) 2 )) 

Which is a beta function. It was this beta function which was apparently found before full string theory was 
developed. With superstrings the equations contain not only the 10D space-time coordinates X but also the 
Grassmann coordinates 6. Since there are various ways this can be done this leads to different string theories. 

When integrating over surfaces such as the torus, we end up with equations in terms of theta functions and elliptic 
functions such as the Dedekind eta function. This is smooth everywhere, which it has to be to make physical sense, 
only when raised to the 24th power. This is the origin of needing 26 dimensions of space-time for bosonic string 
theory. The extra two dimensions arise as degrees of freedom of the string surface. 
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D-branes 

D-branes are membrane-like objects in 10D string theory. They can be thought of as occurring as a result of a 
Kaluza- Klein compactification of 11D M-Theory which contains membranes. Because compactification of a 
geometric theory produces extra vector fields the D-branes can be included in the action by adding an extra U(l) 
vector field to the string action. 

d z -> d z + iA z (z,z) 

In type I open string theory, the ends of open strings are always attached to D-brane surfaces. A string theory with 
more gauge fields such as SU(2) gauge fields would then correspond to the compactification of some higher 
dimensional theory above 1 1 dimensions which is not thought to be possible to date. 

Why five superstring theories? 

For a 10 dimensional supersymmetric theory we are allowed a 32-component Majorana spinor. This can be 
decomposed into a pair of 16-component Majorana- Weyl (chiral) spinors. There are then various ways to construct 
an invariant depending on whether these two spinors have the same or opposite chiralities: 



Superstring model 


Invariant 


Heterotic 




IIA 
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IIB 
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The heterotic superstrings come in two types SO(32) and E 0 xE 0 as indicated above and the type I superstrings 

o o 

include open strings. 



Beyond superstring theory 

It is commonly believed that the five superstring theories are approximated to a theory in higher dimensions possibly 
involving membranes. Unfortunately because the action for this involves quartic terms and higher so is not Gaussian 
the functional integrals are very difficult to solve and so this has confounded the top theoretical physicists. Edward 
Witten has popularised the concept of a theory in 1 1 dimensions M-Theory involving membranes interpolating from 
the known symmetries of superstring theory. It may turn out that there exist membrane models or other 
non-membrane models in higher dimensions which may become acceptable when new unknown symmetries of 
nature are found, such as noncommutative geometry for example. It is thought, however, that 16 is probably the 
maximum since 0(16) is a maximal subgroup of E8 the largest exceptional lie group and also is more than large 
enough to contain the Standard Model. Quartic integrals of the non-functional kind are easier to solve so there is 
hope for the future. This is the series solution which is always convergent when a is non-zero and negative: 

/expfax 4 + bx 3 + cx 2 + dx + f) dx — - — — — - — — — 
-°° ^ /; JZ* (4»)! (2m)! (4p)l 

In the case of membranes the series would correspond to sums of various membrane interactions that are not seen in 
string theory. 

Compactification 

Investigating theories of higher dimensions often involves looking at the 10 dimensional superstring theory and 
interpreting some of the more obscure results in terms of compactified dimensions. For example D-branes are seen 
as compactified membranes from 11D M-Theory. Theories of higher dimensions such as 12D F-theory and beyond 
will produce other effects such as gauge terms higher than U(l). The components of the extra vector fields (A) in the 
D-brane actions can be thought of as extra coordinates (X) in disguise. However, the known symmetries including 
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supersymmetry currently restrict the spinors to have 32-components which limits the number of dimensions to 11 (or 
12 if you include two time dimensions.) Some commentators (e.g. John Baez et al.) have speculated that the 
exceptional lie groups E,, E_ and E_ having maximum orthogonal subgroups O(10), 0(12) and 0(16) may be related 
to theories in 10, 12 and 16 dimensions; 10 dimensions corresponding to string theory and the 12 and 16 dimensional 
theories being yet undiscovered but would be theories based on 3-branes and 7-branes respectively. However this is 
a minority view within the string community. Since E_ is in some sense F 4 quaternified and E g is F 4 octonified, then 
the 12 and 16 dimensional theories, if they did exist, may involve the noncommutative geometry based on the 
quaternions and octonions respectively. From the above discussion, it can be seen that physicists have many ideas for 
extending superstring theory beyond the current 10 dimensional theory, but so far none have been successful. 

Kac-Moody algebras 

Since strings can have an infinite number of modes, the symmetry used to describe string theory is based on infinite 
dimensional Lie algebras. Some Kac-Moody algebras that have been considered as symmetries for M-Theory have 
been E and E^ and their supersymmetric extensions. 
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Group Representations and Symmetry 



Symmetry 



Symmetry (from the Greek: " aujijiexpeiv" = to 
measure together), generally conveys two primary 
meanings. The first is an imprecise sense of 
harmonious or aesthetically pleasing proportionality 
and balance; ^ ^ such that it reflects beauty or 
perfection. The second meaning is a precise and 
well-defined concept of balance or "patterned 
self- similarity" that can be demonstrated or proved 
according to the rules of a formal system: by geometry, 
through physics or otherwise. 



<3 






SYMMETRIC ASYMMETRIC 



Although the meanings are distinguishable in some 
contexts, both meanings of "symmetry" are related and 
discussed in parallel.^ ^ 

The "precise" notions of symmetry have various 
measures and operational definitions. For example, 
symmetry may be observed: 

• with respect to the passage of time; 

• as a spatial relationship; 

• through geometric transformations such as scaling, 
reflection, and rotation; 

• through other kinds of functional transformations;^ 
and 

• as an aspect of abstract objects, theoretic models, 
language, music and even knowledge itself ^ 

This article describes these notions of symmetry from 
four perspectives. The first is that of symmetry in 
geometry, which is the most familiar type of symmetry for many people. The second perspective is the more general 
meaning of symmetry in mathematics as a whole. The third perspective describes symmetry as it relates to science 
and technology. In this context, symmetries underlie some of the most profound results found in modern physics, 
including aspects of space and time. Finally, a fourth perspective discusses symmetry in the humanities, covering its 
rich and varied use in history, architecture, art, and religion. 
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Sphere symmetrical group o. 



The opposite of symmetry is asymmetry. 
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Symmetry in geometry 




The most familiar type of symmetry for many people is 
geometrical symmetry. Formally, this means symmetry under a 
sub-group of the Euclidean group of isometries in two or three 
dimensional Euclidean space. These isometries consist of 
reflections, rotations, translations and combinations of these basic 
operations. 




Reflection symmetry 



\ 



\ 



Reflection symmetry, mirror symmetry, mirror-image symmetry, 
or bilateral symmetry is symmetry with respect to reflection. 

In ID, there is a point of symmetry. In 2D there is an axis of 
symmetry, in 3D a plane of symmetry. An object or figure which 
is indistinguishable from its transformed image is called mirror 
symmetric (see mirror image). 



The axis of symmetry of a two-dimensional figure is a line such that, if a perpendicular is constructed, any two 
points lying on the perpendicular at equal distances from the axis of symmetry are identical. Another way to think 
about it is that if the shape were to be folded in half over the axis, the two halves would be identical: the two halves 
are each other's mirror image. Thus a square has four axes of symmetry, because there are four different ways to fold 
it and have the edges all match. A circle has infinitely many axes of symmetry, for the same reason. 

If the letter T is reflected along a vertical axis, it appears the same. Note that this is sometimes called horizontal 
symmetry, and sometimes vertical symmetry. One can better use an unambiguous formulation, e.g. M T has a vertical 
symmetry axis" or "T has left-right symmetry." 

The triangles with this symmetry are isosceles, the quadrilaterals with this symmetry are the kites and the isosceles 
trapezoids. 

For each line or plane of reflection, the symmetry group is isomorphic with Cs (see point groups in three 
dimensions), one of the three types of order two (involutions), hence algebraically C2. The fundamental domain is a 
half-plane or half- space. 

Bilateria (bilateral animals, including humans) are more or less symmetric with respect to the sagittal plane. 

In certain contexts there is rotational symmetry anyway. Then mirror-image symmetry is equivalent with inversion 
symmetry; in such contexts in modern physics the term P-symmetry is used for both (P stands for parity). 

For more general types of reflection there are corresponding more general types of reflection symmetry. Examples: 

• with respect to a non-isometric affine involution (an oblique reflection in a line, plane, etc.). 

• with respect to circle inversion 



Leonardo da Vinci's Vitruvian Man (ca. 1487) is often 
used as a representation of symmetry in the human 
body and, by extension, the natural universe. 
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Rotational symmetry 

Rotational symmetry is symmetry with respect to some or all rotations in m-dimensional Euclidean space. Rotations 
are direct isometries, i.e., isometries preserving orientation. Therefore a symmetry group of rotational symmetry is a 
subgroup of E + (m). 

Symmetry with respect to all rotations about all points implies translational symmetry with respect to all translations, 
and the symmetry group is the whole E + (m). This does not apply for objects because it makes space homogeneous, 
but it may apply for physical laws. 

For symmetry with respect to rotations about a point we can take that point as origin. These rotations form the 
special orthogonal group SO(m), the group of mxm orthogonal matrices with determinant 1. For m=3 this is the 
rotation group. 

In another meaning of the word, the rotation group of an object is the symmetry group within £ + (n), the group of 
direct isometries; in other words, the intersection of the full symmetry group and the group of direct isometries. For 
chiral objects it is the same as the full symmetry group. 

Laws of physics are SO(3)-invariant if they do not distinguish different directions in space. Because of Noether's 
theorem, rotational symmetry of a physical system is equivalent to the angular momentum conservation law. See 
also rotational in variance. 

Translational symmetry 

Translational symmetry leaves an object invariant under a discrete or continuous group of translations 
T a(p) =p + a. 

Glide reflection symmetry 

A glide reflection symmetry (in 3D: a glide plane symmetry) means that a reflection in a line or plane combined with 
a translation along the line / in the plane, results in the same object. It implies translational symmetry with twice the 
translation vector. 

The symmetry group is isomorphic with Z. 
Rotoreflection symmetry 

In 3D, rotoreflection or improper rotation in the strict sense is rotation about an axis, combined with reflection in a 
plane perpendicular to that axis. As symmetry groups with regard to a roto-reflection we can distinguish: 

• the angle has no common divisor with 360°, the symmetry group is not discrete 

• 2?z-fold rotoreflection (angle of 1 80°M) with symmetry group S of order 2n (not to be confused with symmetric 
groups, for which the same notation is used; abstract group a special case is n=l, inversion, because it does 
not depend on the axis and the plane, it is characterized by just the point of inversion. 

• C (angle of 3607ft); for odd n this is generated by a single symmetry, and the abstract group is C , for even n 
this is not a basic symmetry but a combination. See also point groups in three dimensions. 

Helical symmetry 

Helical symmetry is the kind of 
symmetry seen in such everyday objects 
as springs, Slinky toys, drill bits, and 
augers. It can be thought of as rotational 

A drill bit with helical symmetry. 
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symmetry along with translation along the axis of rotation, the screw axis). The concept of helical symmetry can be 
visualized as the tracing in three-dimensional space that results from rotating an object at an even angular speed 
while simultaneously moving at another even speed along its axis of rotation (translation). At any one point in time, 
these two motions combine to give a coiling angle that helps define the properties of the tracing. When the tracing 
object rotates quickly and translates slowly, the coiling angle will be close to 0°. Conversely, if the rotation is slow 
and the translation is speedy, the coiling angle will approach 90°. 

Three main classes of helical symmetry can be distinguished based on the interplay of the angle of coiling and 
translation symmetries along the axis: 

• Infinite helical symmetry. If there are no distinguishing features along the length of a helix or helix-like object, 
the object will have infinite symmetry much like that of a circle, but with the additional requirement of translation 
along the long axis of the object to return it to its original appearance. A helix-like object is one that has at every 
point the regular angle of coiling of a helix, but which can also have a cross section of indefinitely high 
complexity, provided only that precisely the same cross section exists (usually after a rotation) at every point 
along the length of the object. Simple examples include evenly coiled springs, slinkies, drill bits, and augers. 
Stated more precisely, an object has infinite helical symmetries if for any small rotation of the object around its 
central axis there exists a point nearby (the translation distance) on that axis at which the object will appear 
exactly as it did before. It is this infinite helical symmetry that gives rise to the curious illusion of movement 
along the length of an auger or screw bit that is being rotated. It also provides the mechanically useful ability of 
such devices to move materials along their length, provided that they are combined with a force such as gravity or 
friction that allows the materials to resist simply rotating along with the drill or auger. 

• n-fold helical symmetry. If the requirement that every cross section of the helical object be identical is relaxed, 
additional lesser helical symmetries become possible. For example, the cross section of the helical object may 
change, but still repeats itself in a regular fashion along the axis of the helical object. Consequently, objects of this 
type will exhibit a symmetry after a rotation by some fixed angle Q and a translation by some fixed distance, but 
will not in general be invariant for any rotation angle. If the angle (rotation) at which the symmetry occurs divides 
evenly into a full circle (360°), the result is the helical equivalent of a regular polygon. This case is called n-fold 
helical symmetry, where n = 360°/ Q , see e.g. double helix. This concept can be further generalized to include 
cases where rnO is a multiple of 360° — that is, the cycle does eventually repeat, but only after more than one full 
rotation of the helical object. 

• Non-repeating helical symmetry. This is the case in which the angle of rotation Q required to observe the 
symmetry is irrational. The angle of rotation never repeats exactly no matter how many times the helix is rotated. 
Such symmetries are created by using a non-repeating point group in two dimensions. DNA is an example of this 
type of non-repeating helical symmetry. 

Non-isometric symmetries 

A wider definition of geometric symmetry allows operations from a larger group than the Euclidean group of 
isometries. Examples of larger geometric symmetry groups are: 

• The group of similarity transformations, i.e. affine transformations represented by a matrix A that is a scalar times 
an orthogonal matrix. Thus dilations are added, self-similarity is considered a symmetry. 

• The group of affine transformations represented by a matrix A with determinant 1 or -1, i.e. the transformations 
which preserve area; this adds e.g. oblique reflection symmetry. 

• The group of all bijective affine transformations. 

• The group of Mobius transformations which preserve cross-ratios. 

In Felix Klein's Erlangen program, each possible group of symmetries defines a geometry in which objects that are 
related by a member of the symmetry group are considered to be equivalent. For example, the Euclidean group 
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defines Euclidean geometry, whereas the group of Mobius transformations defines projective geometry. 
Scale symmetry and fractals 

Scale symmetry refers to the idea that if an object is expanded or reduced in size, the new object has the same 
properties as the original. Scale symmetry is notable for the fact that it does not exist for most physical systems, a 
point that was first discerned by Galileo. Simple examples of the lack of scale symmetry in the physical world 
include the difference in the strength and size of the legs of elephants versus mice, and the observation that if a 
candle made of soft wax was enlarged to the size of a tall tree, it would immediately collapse under its own weight. 

A more subtle form of scale symmetry is demonstrated by fractals. As conceived by Benoit Mandelbrot, fractals are 
a mathematical concept in which the structure of a complex form looks similar or even exactly the same no matter 
what degree of magnification is used to examine it. A coast is an example of a naturally occurring fractal, since it 
retains roughly comparable and similar-appearing complexity at every level from the view of a satellite to a 
microscopic examination of how the water laps up against individual grains of sand. The branching of trees, which 
enables children to use small twigs as stand-ins for full trees in dioramas, is another example. 

This similarity to naturally occurring phenomena provides fractals with an everyday familiarity not typically seen 
with mathematically generated functions. As a consequence, they can produce strikingly beautiful results such as the 
Mandelbrot set. Intriguingly, fractals have also found a place in CG, or computer-generated movie effects, where 
their ability to create very complex curves with fractal symmetries results in more realistic virtual worlds. 

Symmetry in mathematics 

In formal terms, we say that a mathematical object is symmetric with respect to a given mathematical operation, if, 
when applied to the object, this operation preserves some property of the object. The set of operations that preserve a 
given property of the object form a group. Two objects are symmetric to each other with respect to a given group of 
operations if one is obtained from the other by some of the operations (and vice versa). 

Mathematical model for symmetry 

The set of all symmetry operations considered, on all objects in a set X, can be modeled as a group action g : G x X 
—> X, where the image of g in G and x in X is written as g-x. If, for some g, g x = y then x and y are said to be 
symmetrical to each other. For each object x, operations g for which g-x = x form a group, the symmetry group of 
the object, a subgroup of G. If the symmetry group of x is the trivial group then x is said to be asymmetric, 
otherwise symmetric. A general example is that G is a group of bijections g\ V —> V acting on the set of functions x: 
V —> Why (gx)(v)=x(g~ J (v)) (or a restricted set of such functions that is closed under the group action). Thus a group 
of bijections of space induces a group action on "objects" in it. The symmetry group of x consists of all g for which 
x(v)=x(g(v)) for all v. G is the symmetry group of the space itself, and of any object that is uniform throughout space. 
Some subgroups of G may not be the symmetry group of any object. For example, if the group contains for every v 
and w in V a g such that g(v)=w, then only the symmetry groups of constant functions x contain that group. However, 
the symmetry group of constant functions is G itself. 

In a modified version for vector fields, we have (gx)(v)=h(g,x(g~ J (v))) where h rotates any vectors and 
pseudovectors in x, and inverts any vectors (but not pseudovectors) according to rotation and inversion in g, see 
symmetry in physics. The symmetry group of x consists of all g for which x(v)=h(g,x(g(v))) for all v. In this case the 
symmetry group of a constant function may be a proper subgroup of G: a constant vector has only rotational 
symmetry with respect to rotation about an axis if that axis is in the direction of the vector, and only inversion 
symmetry if it is zero. 

For a common notion of symmetry in Euclidean space, G is the Euclidean group E(n), the group of isometries, and V 
is the Euclidean space. The rotation group of an object is the symmetry group if G is restricted to E + (n), the group 
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of direct isometries. (For generalizations, see the next subsection.) Objects can be modeled as functions x, of which a 
value may represent a selection of properties such as color, density, chemical composition, etc. Depending on the 
selection we consider just symmetries of sets of points (x is just a Boolean function of position v), or, at the other 
extreme, e.g. symmetry of right and left hand with all their structure. 

For a given symmetry group, the properties of part of the object, fully define the whole object. Considering points 
equivalent which, due to the symmetry, have the same properties, the equivalence classes are the orbits of the group 
action on the space itself. We need the value of x at one point in every orbit to define the full object. A set of such 
representatives forms a fundamental domain. The smallest fundamental domain does not have a symmetry; in this 
sense, one can say that symmetry relies upon asymmetry. 

An object with a desired symmetry can be produced by choosing for every orbit a single function value. Starting 
from a given object x we can e.g.: 

• take the values in a fundamental domain (i.e., add copies of the object) 

• take for each orbit some kind of average or sum of the values of x at the points of the orbit (ditto, where the copies 
may overlap) 

If it is desired to have no more symmetry than that in the symmetry group, then the object to be copied should be 
asymmetric. 

As pointed out above, some groups of isometries are not the symmetry group of any object, except in the modified 
model for vector fields. For example, this applies in ID for the group of all translations. The fundamental domain is 
only one point, so we can not make it asymmetric, so any "pattern" invariant under translation is also invariant under 
reflection (these are the uniform "patterns"). 

In the vector field version continuous translational symmetry does not imply reflectional symmetry: the function 
value is constant, but if it contains nonzero vectors, there is no reflectional symmetry. If there is also reflectional 
symmetry, the constant function value contains no nonzero vectors, but it may contain nonzero pseudo vectors. A 
corresponding 3D example is an infinite cylinder with a current perpendicular to the axis; the magnetic field (a 
pseudovector) is, in the direction of the cylinder, constant, but nonzero. For vectors (in particular the current density) 
we have symmetry in every plane perpendicular to the cylinder, as well as cylindrical symmetry. This cylindrical 
symmetry without mirror planes through the axis is also only possible in the vector field version of the symmetry 
concept. A similar example is a cylinder rotating about its axis, where magnetic field and current density are 
replaced by angular momentum and velocity, respectively. 

A symmetry group is said to act transitively on a repeated feature of an object if, for every pair of occurrences of the 
feature there is a symmetry operation mapping the first to the second. For example, in ID, the symmetry group of 
{...,1,2,5,6,9,10,13,14,...} acts transitively on all these points, while {...,1,2,3,5,6,7,9,10,11,13,14,15,...} does not act 
transitively on all points. Equivalently, the first set is only one conjugacy class with respect to isometries, while the 
second has two classes. 
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Symmetric functions 

A symmetric function is a function which is unchanged by any permutation of its variables. For example, x + y + z 
and xy + yz + xz are symmetric functions, whereas x - yz is not. 

A function may be unchanged by a sub-group of all the permutations of its variables. For example, ac + 3ab + be is 
unchanged if a and b are exchanged; its symmetry group is isomorphic to C^. 

Symmetry in logic 

A dyadic relation R is symmetric if and only if, whenever it's true that Rab, it's true that Rba. Thus, "is the same age 
as" is symmetrical, for if Paul is the same age as Mary, then Mary is the same age as Paul. 

Symmetric binary logical connectives are "and" (a, A , or &), "or" (v), "biconditional" (if and only if) (^), NAND 
("not-and"), XOR ("not-biconditional"), and NOR ("not-or"). 

Symmetry in science 
Symmetry in physics 

Symmetry in physics has been generalized to mean invariance — that is, lack of any visible change — under any kind 
of transformation, for example arbitrary coordinate transformations. This concept has become one of the most 
powerful tools of theoretical physics, as it has become evident that practically all laws of nature originate in 
symmetries. In fact, this role inspired the Nobel laureate PW Anderson to write in his widely read 1972 article More 
is Different that "it is only slightly overstating the case to say that physics is the study of symmetry." See Noether's 
theorem (which, in greatly simplified form, states that for every continuous mathematical symmetry, there is a 
corresponding conserved quantity; a conserved current, in Noether's original language); and also, Wigner's 
classification, which says that the symmetries of the laws of physics determine the properties of the particles found 
in nature. 

Symmetry in physical objects 
Classical objects 

Although an everyday object may appear exactly the same after a symmetry operation such as a rotation or an 
exchange of two identical parts has been performed on it, it is readily apparent that such a symmetry is true only as 
an approximation for any ordinary physical object. 

For example, if one rotates a precisely machined aluminum equilateral triangle 120 degrees around its center, a 
casual observer brought in before and after the rotation will be unable to decide whether or not such a rotation took 
place. However, the reality is that each corner of a triangle will always appear unique when examined with sufficient 
precision. An observer armed with sufficiently detailed measuring equipment such as optical or electron microscopes 
will not be fooled; he will immediately recognize that the object has been rotated by looking for details such as 
crystals or minor deformities. 

Such simple thought experiments show that assertions of symmetry in everyday physical objects are always a matter 
of approximate similarity rather than of precise mathematical sameness. The most important consequence of this 
approximate nature of symmetries in everyday physical objects is that such symmetries have minimal or no impacts 
on the physics of such objects. Consequently, only the deeper symmetries of space and time play a major role in 
classical physics — that is, the physics of large, everyday objects. 
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Quantum objects 

Remarkably, there exists a realm of physics for which mathematical assertions of simple symmetries in real objects 
cease to be approximations. That is the domain of quantum physics, which for the most part is the physics of very 
small, very simple objects such as electrons, protons, light, and atoms. 

Unlike everyday objects, objects such as electrons have very limited numbers of configurations, called states, in 
which they can exist. This means that when symmetry operations such as exchanging the positions of components 
are applied to them, the resulting new configurations often cannot be distinguished from the originals no matter how 
diligent an observer is. Consequently, for sufficiently small and simple objects the generic mathematical symmetry 
assertion F(x) = x ceases to be approximate, and instead becomes an experimentally precise and accurate description 
of the situation in the real world. 

Consequences of quantum symmetry 

While it makes sense that symmetries could become exact when applied to very simple objects, the immediate 
intuition is that such a detail should not affect the physics of such objects in any significant way. This is in part 
because it is very difficult to view the concept of exact similarity as physically meaningful. Our mental picture of 
such situations is invariably the same one we use for large objects: We picture objects or configurations that are 
very, very similar, but for which if we could "look closer" we would still be able to tell the difference. 

However, the assumption that exact symmetries in very small objects should not make any difference in their physics 
was discovered in the early 1900s to be spectacularly incorrect. The situation was succinctly summarized by Richard 
Feynman in the direct transcripts of his Feynman Lectures on Physics, Volume III, Section 3.4, Identical particles. 
(Unfortunately, the quote was edited out of the printed version of the same lecture.) 

"... if there is a physical situation in which it is impossible to tell which way it happened, it always interferes; it 
never fails." 

The word "interferes" in this context is a quick way of saying that such objects fall under the rules of quantum 
mechanics, in which they behave more like waves that interfere than like everyday large objects. 

In short, when an object becomes so simple that a symmetry assertion of the form F(x) = x becomes an exact 
statement of experimentally verifiable sameness, x ceases to follow the rules of classical physics and must instead be 
modeled using the more complex — and often far less intuitive — rules of quantum physics. 

This transition also provides an important insight into why the mathematics of symmetry are so deeply intertwined 
with those of quantum mechanics. When physical systems make the transition from symmetries that are approximate 
to ones that are exact, the mathematical expressions of those symmetries cease to be approximations and are 
transformed into precise definitions of the underlying nature of the objects. From that point on, the correlation of 
such objects to their mathematical descriptions becomes so close that it is difficult to separate the two. 

Generalizations of symmetry 

If we have a given set of objects with some structure, then it is possible for a symmetry to merely convert only one 
object into another, instead of acting upon all possible objects simultaneously. This requires a generalization from 
the concept of symmetry group to that of a groupoid. Indeed, A. Connes in his book v Non-commutative geometry' 
writes that Heisenberg discovered quantum mechanics by considering the groupoid of transitions of the hydrogen 
spectrum. 

The notion of groupoid also leads to notions of multiple groupoids, namely sets with many compatible groupoid 
structures, a structure which trivialises to abelian groups if one restricts to groups. This leads to prospects of "higher 
order symmetry' which have been a little explored, as follows. 

The automorphisms of a set, or a set with some structure, form a group, which models a homotopy 1-type. The 
automorphisms of a group G naturally form a crossed module G —> Aut(G), and crossed modules give an 
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algebraic model of homotopy 2-types. At the next stage, automorphisms of a crossed module fit into a structure 
known as a crossed square, and this structure is known to give an algebraic model of homotopy 3-types. It is not 
known how this procedure of generalising symmetry may be continued, although crossed n-cubes have been defined 

T71 T81 

and used in algebraic topology, and these structures are only slowly being brought into theoretical physics. 

Physicists have come up with other directions of generalization, such as supersymmetry and quantum groups, yet the 
different options are indistinguishable during various circumstances. 

Symmetry in chemistry 

Symmetry is important to chemistry because it explains observations in spectroscopy, quantum chemistry and 
crystallography. It draws heavily on group theory. 



Symmetry in history, religion, and culture 

In any human endeavor for which an impressive visual result is part of the desired objective, symmetries play a 
profound role. The innate appeal of symmetry can be found in our reactions to happening across highly symmetrical 
natural objects, such as precisely formed crystals or beautifully spiraled seashells. Our first reaction in finding such 
an object often is to wonder whether we have found an object created by a fellow human, followed quickly by 
surprise that the symmetries that caught our attention are derived from nature itself. In both reactions we give away 
our inclination to view symmetries both as beautiful and, in some fashion, informative of the world around us. 

Symmetry in religious symbols 

The tendency of people to see purpose in symmetry suggests at least 
one reason why symmetries are often an integral part of the symbols of 
world religions. Just a few of many examples include the sixfold 
rotational symmetry of Judaism's Star of David, the twofold point 
symmetry of Taoism's Taijitu or Yin- Yang, the bilateral symmetry of 
Christianity's cross and Sikhism's Khanda, or the fourfold point 
symmetry of Jain's ancient (and peacefully intended) version of the 
swastika. With its strong prohibitions against the use of 
representational images, Islam, and in particular the Sunni branch of 
Islam, has developed intricate use of symmetries. 

Symmetry in Social Interactions Symmetry in religious symbols. Row 1. 

Christian, Jewish, Taoism Row 2. Islamic, 

People observe the symmetrical nature, often including asymmetrical Buddhist, Shinto Row 3. Sikh, Baha'i, Jain 
balance, of social interactions in a variety of contexts. These include 

assessments of reciprocity, empathy, apology, dialog, respect, justice, and revenge. Symmetrical interactions send 
the message "we are all the same" while asymmetrical interactions send the message "I am special; better than you." 
Peer relationships are based on symmetry, power relationships are based on asymmetry 




Symmetry in architecture 

Another human endeavor in which the visual result plays a vital part in the overall result is architecture. Both in 
ancient times, the ability of a large structure to impress or even intimidate its viewers has often been a major part of 
its purpose, and the use of symmetry is an inescapable aspect of how to accomplish such goals. 

Just a few examples of ancient examples of architectures that made powerful use of symmetry to impress those 
around them included the Egyptian Pyramids, the Greek Parthenon, the first and second Temple of Jerusalem, 
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China's Forbidden City, Cambodia's Angkor Wat complex, and the many temples and pyramids of ancient 
Pre-Columbian civilizations. More recent historical examples of architectures emphasizing symmetries include 
Gothic architecture cathedrals, and American President Thomas Jefferson's Monticello home. India's unparalleled 
Taj Mahal is in a category by itself, as it may arguably be one of the most impressive and beautiful uses of symmetry 
in architecture that the world has ever seen. 



An interesting example of a broken symmetry in architecture is the Leaning 
Tower of Pisa, whose notoriety stems in no small part not for the intended 
symmetry of its design, but for the violation of that symmetry from the lean 
that developed while it was still under construction. Modern examples of 
architectures that make impressive or complex use of various symmetries 
include Australia's Sydney Opera House and Houston, Texas's simpler 
Astrodome. 

Symmetry finds its ways into architecture at every scale, from the overall 
external views, through the layout of the individual floor plans, and down 
to the design of individual building elements such as intricately caved 
doors, stained glass windows, tile mosaics, friezes, stairwells, stair rails, 
and balustrades s. For sheer complexity and sophistication in the 
exploitation of symmetry as an architectural element, Islamic buildings 
such as the Taj Mahal often eclipse those of other cultures and ages, due in 
part to the general prohibition of Islam against using images of people or 
animals J 10 ^ 




Leaning Tower of Pisa 



Symmetry in pottery and metal vessels 



Since the earliest uses of pottery wheels to help shape clay vessels, 
pottery has had a strong relationship to symmetry. As a minimum, 
pottery created using a wheel necessarily begins with full rotational 
symmetry in its cross-section, while allowing substantial freedom of 
shape in the vertical direction. Upon this inherently symmetrical 
starting point cultures from ancient times have tended to add further 
patterns that tend to exploit or in many cases reduce the original full 
rotational symmetry to a point where some specific visual objective is 
achieved. For example, Persian pottery dating from the fourth 
millennium B.C. and earlier used symmetric zigzags, squares, 
cross-hatchings, and repetitions of figures to produce more complex 
and visually striking overall designs. 

Cast metal vessels lacked the inherent rotational symmetry of 




wheel-made pottery, but otherwise provided a similar opportunity to Persian vessel (4th millennium B.C.) 

decorate their surfaces with patterns pleasing to those who used them. 

The ancient Chinese, for example, used symmetrical patterns in their bronze castings as early as the 17th century 
B.C. Bronze vessels exhibited both a bilateral main motif and a repetitive translated border design J 12 ^ ^ 13 ^ ^ 
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Symmetry in quilts 

As quilts are made from square blocks (usually 9, 16, or 25 pieces to a block) with 
each smaller piece usually consisting of fabric triangles, the craft lends itself readily 
to the application of symmetry J 15 ^ 




Kitchen Kaleidoscope Block 



Symmetry in carpets and rugs 

A long tradition of the use of symmetry in 
carpet and rug patterns spans a variety of 
cultures. American Navajo Indians used 
bold diagonals and rectangular motifs. Many 
Oriental rugs have intricate reflected centers 
and borders that translate a pattern. Not 
surprisingly, rectangular rugs typically use 
quadrilateral symmetry — that is, motifs that 
are reflected across both the horizontal and vertical axesJ 16 ^ ^ 




Symmetry in music 



Symmetry is of course not restricted to 
the visual arts. Its role in the history of 
music touches many aspects of the 
creation and perception of music. 

Musical form 

Symmetry has been used as a formal 
constraint by many composers, such as 
the arch (swell) form (ABCBA) used 
by Steve Reich, Bela Bartok, and 
James Tenney. In classical music, 
Bach used the symmetry concepts of 






permutation and in variance. 



[18] 



Major and minor triads on the white piano keys are symmetrical to the D. (compare 

article) (file) 



Pitch structures 

Symmetry is also an important consideration in the formation of scales and chords, traditional or tonal music being 
made up of non-symmetrical groups of pitches, such as the diatonic scale or the major chord. Symmetrical scales or 
chords, such as the whole tone scale, augmented chord, or diminished seventh chord (diminished-diminished 
seventh), are said to lack direction or a sense of forward motion, are ambiguous as to the key or tonal center, and 
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have a less specific diatonic functionality. However, composers such as Alban Berg, Bela Bartok, and George Perle 
have used axes of symmetry and/or interval cycles in an analogous way to keys or non-tonal tonal centers. 

Perle (1992) explains "C-E, D-F#, [and] Eb-G, are different instances of the same interval... the other kind of identity. 
..has to do with axes of symmetry. C-E belongs to a family of symmetrically related dyads as follows:" 

D D# E F F# G G# 
D C# C B A# A G# 

Thus in addition to being part of the interval-4 family, C-E is also a part of the sum-4 family (with C equal to 0). 

+ 2345 6 78 
2 1 0 11 10 9 8 
4 4 4 4 4 4 4 

Interval cycles are symmetrical and thus non-diatonic. However, a seven pitch segment of C5 (the cycle of fifths, 
which are enharmonic with the cycle of fourths) will produce the diatonic major scale. Cyclic tonal progressions in 
the works of Romantic composers such as Gustav Mahler and Richard Wagner form a link with the cyclic pitch 
successions in the atonal music of Modernists such as Bartok, Alexander Scriabin, Edgard Varese, and the Vienna 
school. At the same time, these progressions signal the end of tonality. 

The first extended composition consistently based on symmetrical pitch relations was probably Alban Berg's 
Quartet, Op. 3 (1910). (Perle, 1990) 

Equivalency 

Tone rows or pitch class sets which are invariant under retrograde are horizontally symmetrical, under inversion 
vertically. See also Asymmetric rhythm. 

Symmetry in other arts and crafts 

The concept of symmetry is applied to the design of objects of all 
shapes and sizes. Other examples include beadwork, furniture, sand 
paintings, knotwork, masks, musical instruments, and many other 
endeavors. 

Symmetry in aesthetics 

The relationship of symmetry to aesthetics is complex. Certain simple 
symmetries, and in particular bilateral symmetry, seem to be deeply 
ingrained in the inherent perception by humans of the likely health or fitness of other living creatures, as can be seen 
by the simple experiment of distorting one side of the image of an attractive face and asking viewers to rate the 
attractiveness of the resulting image. Consequently, such symmetries that mimic biology tend to have an innate 
appeal that in turn drives a powerful tendency to create artifacts with similar symmetry. One only needs to imagine 
the difficulty in trying to market a highly asymmetrical car or truck to general automotive buyers to understand the 
power of biologically inspired symmetries such as bilateral symmetry. 

Another more subtle appeal of symmetry is that of simplicity, which in turn has an implication of safety, security, 
and familiarity. A highly symmetrical room, for example, is unavoidably also a room in which anything out of place 
or potentially threatening can be identified easily and quickly. For example, people who have grown up in houses 
full of exact right angles and precisely identical artifacts can find their first experience in staying in a room with no 
exact right angles and no exactly identical artifacts to be highly disquieting. Symmetry thus can be a source of 
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comfort not only as an indicator of biological health, but also of a safe and well-understood living environment. 

Opposed to this is the tendency for excessive symmetry to be perceived as boring or uninteresting. Humans in 

particular have a powerful desire to exploit new opportunities or explore new possibilities, and an excessive degree 

of symmetry can convey a lack of such opportunities. Most people display a preference for figures that have a certain 

ri9i 

degree of simplicity and symmetry, but enough complexity to make them interesting. 1 

Yet another possibility is that when symmetries become too complex or too challenging, the human mind has a 
tendency to "tune them out" and perceive them in yet another fashion: as noise that conveys no useful information. 

Finally, perceptions and appreciation of symmetries are also dependent on cultural background. The far greater use 
of complex geometric symmetries in many Islamic cultures, for example, makes it more likely that people from such 
cultures will appreciate such art forms (or, conversely, to rebel against them). 

As in many human endeavors, the result of the confluence of many such factors is that effective use of symmetry in 
art and architecture is complex, intuitive, and highly dependent on the skills of the individuals who must weave and 
combine such factors within their own creative work. Along with texture, color, proportion, and other factors, 
symmetry is a powerful ingredient in any such synthesis; one only need to examine the Taj Mahal to powerful role 
that symmetry plays in determining the aesthetic appeal of an object. 

Modernist architecture rejects symmetry, stating only a bad architect relies on symmetry', instead of symmetrical 
layout of blocks, masses and structures, Modernist architecture relies on wings and balance of masses. This notion of 
getting rid of symmetry was first encountered in International style. Some people find asymmetrical layouts of 
buildings and structures revolutionizing; other find them restless, boring and unnatural. 

A few examples of the more explicit use of symmetries in art can be found in the remarkable art of M. C. Escher, the 
creative design of the mathematical concept of a wallpaper group, and the many applications (both mathematical and 
real world) of tiling. 

Notes 

[I] Penrose, Roger (2007). Fearful Symmetry. City: Princeton. ISBN 9780691134826. 

[2] For example, Aristotle ascribed spherical shape to the heavenly bodies, attributing this formally defined geometric measure of symmetry to 

the natural order and perfection of the cosmos. 
[3] Weyll982 

[4] For example, operations such as moving across a regularly patterned tile floor or rotating an eight-sided vase, or complex transformations of 

equations or in the way music is played. 
[5] See e.g., Mainzer, Klaus (2005). Symmetry And Complexity: The Spirit and Beauty of Nonlinear Science. World Scientific. 

ISBN 9812561927. 

[6] Symmetric objects can be material, such as a person, crystal, quilt, floor tiles, or molecule, or it can be an abstract structure such as a 

mathematical equation or a series of tones (music). 
[7] n-category cafe (http://golem.ph.utexas.edu/category/) - discussion of ^-groups 
[8] "Higher dimensional group theory' (http://www.bangor.ac.Uk/r.brown/hdaweb2.htm) 

[9] Emotional Competency (http://www.emotionalcompetency.com/symmetry.htm) Entry describing Symmetry 
[10] Williams: Symmetry in Architecture (http://members.tripod.com/vismath/kim/) 

[II] Aslaksen: Mathematics in Art and Architecture (http://www.math.nus.edu.sg/aslaksen/teaching/math-art-arch.shtml) 
[12] Chinavoc: The Art of Chinese Bronzes (http://www.chinavoc.com/arts/handicraft/bronze.htm) 

[13] Grant: Iranian Pottery in the Oriental Institute (http://www-oi.uchicago.edu/OI/MUS/VOL/NN_SUM94/NN_Sum94.html) 
[14] The Metropolitan Museum of Art - Islamic Art (http://www.metmuseum.org/collections/department.asp?dep=14) 
[15] Quate: Exploring Geometry Through Quilts (http://its.guilford.kl2.nc.us/webquests/quilts/quilts.htm) 
[16] Mallet: Tribal Oriental Rugs (http://www.marlamallett.com/default.htm) 
[17] Dilucchio: Navajo Rugs (http://navajocentral.org/rugs.htm) 

[18] see ("Fugue No. 21," pdf (http://jan.ucc.nau.edu/~tas3/wtc/ii21s.pdf) or Shockwave (http://jan.ucc.nau.edu/~tas3/wtc/ii21.html)) 
[19] Arnheim, Rudolf (1969). Visual Thinking. University of California Press. 
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External links 

• Official website (http://www.representations.org) 

Representations of the symmetric group 

In mathematics, the representation theory of the symmetric group is a particular case of the representation theory 
of finite groups, for which a concrete and detailed theory can be obtained. This has a large area of potential 
applications, from symmetric function theory to problems of quantum mechanics for a number of identical particles. 

The symmetric group has order nl. Its conjugacy classes are labeled by partitions of n. Therefore according to the 
representation theory of a finite group, the number of inequivalent irreducible representations, over the complex 
numbers, is equal to the number of partitions of n. Unlike the general situation for finite groups, there is in fact a 
natural way to parametrize irreducible representation by the same set that parametrizes conjugacy classes, namely by 
partitions of n or equivalently Young diagrams of size n. 

Each such irreducible representation can in fact be realized over the integers (every permutation acting by a matrix 
with integer coefficients); it can be explicitly constructed by computing the Young symmetrizers acting on a space 
generated by the Young tableaux of shape given by the Young diagram. 

Over other fields the situation can become much more complicated. If the field K has characteristic equal to zero or 
greater than n then by Maschke's theorem the group algebra KS is semisimple. In these cases the irreducible 
representations defined over the integers give the complete set of irreducible representations (after reduction modulo 
the characteristic if necessary). 

However, the irreducible representations of the symmetric group are not known in arbitrary characteristic. In this 
context it is more usual to use the language of modules rather than representations. The representation obtained from 
an irreducible representation defined over the integers by reducing modulo the characteristic will not in general be 
irreducible. The modules so constructed are called Specht modules, and every irreducible does arise inside some such 
module. There are now fewer irreducibles, and although they can be classified they are very poorly understood. For 
example, even their dimensions are not known in general. 

The determination of the irreducible modules for the symmetric group over an arbitrary field is widely regarded as 
one of the most important open problems in representation theory. 

Low dimensional representations 

The lowest dimensional representations of the symmetric groups can be described explicitly, as done in (Burnside 
1955, p. 468). This work was extended to the smallest k degrees (explicitly for k=4, and k=l) in (Rasala 1977), and 
over arbitrary fields in (James 1983). The smallest two degrees in characteristic zero are described here: 

Every symmetric group has a one-dimensional representation called the trivial representation, where every element 
acts as the one by one identity matrix. For n > 2, there is another irreducible representation of degree 1, called the 
sign representation or alternating character, which takes a permutation to the one by one matrix with entry ±1 
based on the sign of the permutation. These are the only one-dimensional representations of the symmetric groups, as 
one-dimensional representations are abelian, and the abelianization of the symmetric group is , the cyclic group 
of order 2. 

For all n, there is an /i-dimensional representation of the symmetric group of order n, called the permutation 
representation, which consists of permuting n coordinates. This has the trivial subrepresentation consisting of 
vectors whose coordinates are all equal. The orthogonal complement consists of those vectors whose coordinates 
sum to zero, and when n > 2, the representation on this subspace is an n — 1 dimensional irreducible representation. 
Another n-1 dimensional irreducible representation is found by tensoring with the sign representation. 
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For n > 7, these are the lowest-dimensional irreducible representations of S — all other irreducible representations 
have dimension at least n. However for n = 4, the surjection from to S allows to inherit a two-dimensional 
irreducible representation. For n = 6, the exceptional transitive embedding of into produces another pair of 
five-dimensional irreducible representations. 



Alternating group 



The representation theory of the alternating groups is similar, though 
the sign representation disappears. For n > 7, the lowest dimensional 
irreducible representations are the trivial representation in dimension 
one, and the (n — 1) -dimensional representation from the other 
summand of the permutation representation, with all other irreducible 
representations having higher dimension, but there are exceptions for 
smaller n. 




The compound of five tetrahedra, on which A§ 
acts, giving a 3 -dimensional representation. 



The alternating groups for n > 5 have only one one-dimensional irreducible representation, the trivial representation. 
For n = 3, 4 there are two additional one-dimensional irreducible representations, corresponding to maps to the cyclic 
group of order 3: A 3 = C 3 and A A -> AjV = C 3 . 

• For n > 7, there is just one irreducible representation of degree n-1, and this is the smallest degree of a 
non-trivial irreducible representation. 

• For n = 3 the obvious analogue of the (n - l)-dimensional representation is reducible - the permutation 
representation coincides with the regular representation, and thus breaks up into the three one-dimensional 
representations, as A% = C^is abelian; see the discrete Fourier transform for representation theory of cyclic 
groups. 

• For n = 4, there is just one n - 1 irreducible representation, but there are the exceptional irreducible 
representations of dimension 1 . 

• For n = 5, there are two dual irreducible representations of dimension 3, corresponding to its action as icosahedral 
symmetry. 

• For n = 6, there is an extra irreducible representation of dimension 5 corresponding to the exceptional transitive 
embedding of A in A 
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Representations of a finite group 

In mathematics, representation theory is a technique for analyzing abstract groups in terms of groups of linear 
transformations. See the article on group representations for an introduction. This article discusses the representation 
theory of groups that have a finite number of elements. 

Basic definitions 

All the linear representations in this article are finite dimensional and assumed to be complex unless otherwise 
stated. A representation of G is a group homomorphism p:G —> GL(n,C) from G to the general linear group 
GL(/2,C). Thus to specify a representation, we just assign a square matrix to each element of the group, in such a way 
that the matrices behave in the same way as the group elements when multiplied together. 

We say that p is a real representation of G if the matrices are real. In other words if p(G) C GL(n,R). 

Other formulations 

A representation p: G —> GL(n,C) defines a group action of G on the vector space C n . Moreover this action 
completely determines p. Hence to specify a representation it is enough to specify how it acts on its representing 
vector space. 

Alternatively, the action of a group G on a complex vector space V induces a left action of group algebra C[G] on the 
vector space V, and vice- versa. Hence representations are equivalent to left C[G] -modules. 

The group algebra C[G] is a I Gl -dimensional algebra over the complex numbers, on which G acts. (See Peter-Weyl 
for the case of compact groups.) In fact C[G] is a representation for GxG. More specifically, if g 1 and g 2 are 
elements of G and h is an element of C[G] corresponding to the element h of G, 

(8 r 8 2 )W=g l hg 2 l . 

C[G] can also be considered as a representation of G in three different ways: 

• Conjugation: g[h] = g h g~ l 

• As a left action: g[h] = g h (a regular representation) 
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• As a right action: g[h] = h g (also); 

these are all to be 'found' inside the GxG action. 



Example 



For many groups it is entirely natural to represent the group through matrices. Consider for example the dihedral 
group Z> 4 of symmetries of a square. This is generated by the two reflection matrices 



rn = 



n = 



-1 
0 

0 1 

1 0 



Here m is a reflection that maps (x,y) to (- x,y), while n maps (x,y) to (y,x). Multiplying these matrices together 
creates a set of 8 matrices that form the group. As discussed above, we can either think of the representation in terms 
of the matrices, or in terms of the action on the two-dimensional vector space (x,y). 

This representation is faithful - that is, there is a one-to-one correspondence between the matrices and the elements of 
the group. It is also irreducible, because there is no subspace of (x,y) that is invariant under the action of the group. 



Discrete Fourier transform 

If G is a finite cyclic group, then its representation theory is called the discrete Fourier transform; this example is 
central to digital signal processing. 

All irreducible representations are 1 -dimensional (characters), and correspond to sending a generator of G to a root 
of unity, not necessarily primitive (the trivial representation sends a generator to 1, for instance). 

A function on G is called the time domain representation of the function, while the corresponding expression in 
terms of characters is called the frequency domain representation of the function: changing from the time domain 
description to the frequency domain description is called the discrete Fourier transform, and the opposite direction is 
called the inverse discrete Fourier transform. 

The character table, which in this case is the matrix of the transform, is the DFT matrix, which is, up to 
normalization factor, the Vandermonde matrix for the ni\\ roots of unity; the order of rows and columns depends on a 
choice of generator and primitive root of unity. 

The group of characters is isomorphic to G itself, but not naturally so, and is known as the dual group, G : m me 
language of Pontryagin duality, and the original group G can be recovered as the double dual. 

Abelian groups 

More generally, any finite abelian group is a direct sum of finite cyclic groups (by the fundamental theorem of 
finitely generated abelian groups, though there is no canonical decomposition), and thus the representation theory of 
finite abelian groups is completely described by that of finite cyclic groups, that is, by the discrete Fourier transform. 

If an abelian group is expressed as a direct product, and the dual group likewise decomposed, and the elements of 
each sorted in lexicographic order, then the character table of the product group is the Kronecker product (tensor 
product) of the character tables for the two component groups, which is just a statement that the value of a product 
homomorphism on a product group is the product of the values: x er)(#, h) = p(g) • cr(h). 
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Morphisms between representations 

Given two representations p: G —> GL(^,C) and x: G —> GL(ra,C) a morphism between p and x is a linear map T : C n 
—> C m so that for all g in G we have the following commuting relation: T ° p(g) = x(g) ° T. 

According to Schur's lemma, a non-zero morphism between two irreducible complex representations is invertible, 
and moreover, is given in matrix form as a scalar multiple of the identity matrix. 

This result holds as the complex numbers are algebraically closed. For a counterexample over the real numbers, 
consider the two dimensional irreducible real representation of the cyclic group C^= (x) given by: 

" 0 1" 

p'- x ^[-i o - 

Then the matrix [ _?i q ] defines an automorphism of p, which is clearly not a scalar multiple of the identity matrix. 

Subrepresentations and irreducible representations 

As noted earlier, a representation p defines an action on a vector space C n . It may turn out that C n has an invariant 
subspace V C C n . The action of G is given by complex matrices and this in turn defines a new representation a : G —> 
GL(V). We call a a subrepresentation of p. A representation without subrepresentations is called irreducible. 

Constructing new representations from old 

There are number of ways to combine representations to obtain new representations. Each of these methods involves 
the application of a construction from linear algebra to representation theory. 

• Given two representations p 1? p 2 we may construct their direct sum p 1 © p 2 by (p 1 © p 2 ) (g)(v,w) = (p 1 te)v, 
P 2 (g)w)- 

• The tensor representation of p , p is defined by (p <E> p ) (v <E> w) = p (v) <E> p 2 (w). 

• Let p : G —> GL(n,C) be a representation. Then p induces a representation p on the dual of the vector space 
Hom(C n ,C). Let/: C n — » C be a linear functional. The representation p* is then defined by the rule p* (g) if) = 
f(p(g)~ l ). The representation p is called either the dual representation or the contragredient representation of p. 

• Furthermore, if a representation p has a subrepresentation a then the quotient of the representing vector spaces for 
p and a has a well defined action of G on it. We call the resulting representation the quotient representation of p 
by a. 

Young tableau 

For the symmetric groups, a graphical method exists to determine their finite representations that associates with 
each representation a Young tableau (also known as a Young diagram). The direct product of two representations 
may easily be decomposed into a direct sum of irreducible representation by a set of rules for the "direct product" of 
two Young diagrams. Each diagram also contains information about the dimension of the representation to which it 
corresponds. Young tableaux provide a far cleaner way of working with representations than the algebraic methods 
that underlie their use. 



Representations of a finite group 



345 



Applying Schur f s lemma 

Lemma. If f: A ® B — > C is a morphism of representations, then the corresponding linear transformation 
obtained by dualizing B is :f\A^C®B is also a morphism of representations. Similarly, if g\ A —> B 
®C is a morphism of representations, dualizing it will give another morphism of representations g'\ A ® 
C -^B. 

If p is an /i-dimensional irreducible representation of G with the underlying vector space V, then we can define a 
GxG morphism of representations, for all g in G and x in V 

f: C[G] ® (1 (8) V) -> (V ® 1 ) 

/i(g®x) = p(g)M 

where 1^ is the trivial representation of G. This defines a GxG morphism of representations. 
Now we use the above lemma and obtain the GxG morphism of representations 

/' : V g> V -> C[G] • 

The dual representation of C[G] as a GxG-representation is equivalent to C[G]. An isomorphism is given if we 
define the contraction (g,h) = 5 . So, we end up with a GxG-morphism of representations 

Then 

f"(x®y) = Y;(x,p(9)[y])9 

for all x in J/ and y in V. 

By Schur's lemma, the image of /' is a GxG irreducible representation, which is therefore nxn dimensional, which 
also happens to be a subrepresentation of C[G] if' is nonzero). 

This is n direct sum equivalent copies V. Note that if p 1 and p 2 are equivalent G-irreducible representations, the 
respective images of the intertwining matrices would give rise to the same Gx G-irreducible representation of C[G]. 

Here, we use the fact that iff is a function over G, then 

gEG geG 

We convert C[G] into a Hilbert space by introducing the norm where (g,h) is 1 if g is h and zero otherwise. This 
is different from the 'contraction' given a couple of paragraphs back, in that this form is sesquilinear. This makes 
C[G] a unitary representation of GxG. In particular, we now have the concepts of orthogonal complement and 
orthogonality of subrepresentations. 

In particular, if C[G] contains two inequivalent irreducible GxG subrepresentations, then both subrepresentations are 
orthogonal to each other. To see this, note that for every subspace of a Hilbert space, there exists a unique linear 
transformation from the Hilbert space to itself which maps points on the subspace to itself while mapping points on 
its orthogonal complement to zero. This is called the projection map. The projection map associated with the first 
irreducible representation is an intertwiner. Restricted to the second irreducible representation, it gives an intertwiner 
from the second irreducible representation to the first. Using Schur's lemma, this must be zero. 

Now suppose A ® B is a Gx G-irreducible representation of C[G]. 

Note. The complex irreducible representations of GxH are always a direct product of a complex 
irreducible representation of G and a complex irreducible representation of H. This is not the case for 
real irreducible representations. As an example, there is a 2 dimensional real irreducible representation 
of the group x which transforms nontrivially under both copies of but cannot be expressed as 
the direct product of two irreducible representations of C . 
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This representation is also a G-representation (n direct sum copies of B where n is the dimension of A). If Y is an 
element of this representation (and hence also of C[G]) and X an element of its dual representation (which is a 
subrepresentation of the dual representation of C[G]), then 

f"{X®Y)= Y:(X,p(9)[Y])g= Y:(^Yg-')g 

gEG geG 

= Y:(X,g- 1 )gY =J2(X,g-i){g,e)[Y] 

geG geG 

where e is the identity of G. Though the /' defined a couple of paragraphs back is only defined for G-irreducible 
representations, and though A ® B is not a G-irreducible representation in general, we claim this argument could be 
made correct since A (8) B is simply the direct sum of copies of Bs, and we have shown that each copy all maps to the 
same Gx G-irreducible subrepresentation of C[G], we have just showed that ft gj Q as an irreducible 
GxG-subrepresentation of C[G] is contained in A ® B as another irreducible GxG-subrepresentation of C[G]. Using 
Schur's lemma again, this means both irreducible representations are the same. 
Putting all of this together, 

Theorem. C[G] = V ® G ^ where the sum is taken over the inequivalent G-irreducible 
V 

representations V. 

Corollary. If there are p inequivalent G-irreducible representations, V., each of dimension n., then IGI = 

2 , , 2 * 1 1 

+... + n . 
1 P 

Character theory 

Main article: Character theory 

There is a mapping from G to the complex numbers for each representation called the character given by the trace of 
the linear transformation upon the representation generated by the element of G in question 

X p (g)=Tr[p(g)]. 

All elements of G belonging to the same conjugacy class have the same character: in other words x p is a class 
function on G. This follows from 

Tr[p(ghg 1 )]=Tr[p(g)p(h)p(g)- 1 ]=Tr[p(h)] 

by the cyclic property of the trace of a matrix. 

What are the characters of C[G]? Using the property that gh _1 is only the same as g if h = e, „,(g) is IGI if g=e and 
0 otherwise. 

The character of a direct sum of representations is simply the sum of their individual characters. 

Putting all of this together, 
p 

^rkXvM = \G\6 ge 
i=i 

with the Kronecker delta on the right hand side. 

Repeat this, working with characters of GxG instead of characters, of G which I'll call A. Then, A (g,h) is the 
number of elements k in G satisfying g k h~ = k. This is equal to 
VP p 

i=l i=l i=l 

where * denotes complex conjugation. After all, C[G] is a unitary representation and any subrepresentation of a 
finite unitary representation is another unitary representation; and all irreducible representations are (equivalent to) a 
subrepresentation of C[G]. 
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Consider 
heG 

2 

This is IGI times the number of elements which commute with g\ which is IGI divided by the size of the conjugacy 
class of g, if g and k belong to the same conjugacy class, but zero otherwise. Therefore, for each conjugacy class C. 
of size m., the characters are the same for each element of the conjugacy class and so we can just call by an 

abuse of notation). Then, 

i G , P 



M fc=i 



Note that 

5>(s) 

geG 

is a self-intertwiner (or invariant). This linear transformation, when applied to C[G] (as a representation of the 
second copy of GxG), would give as its image the 1 -dimensional subrepresentation generated by 

EP; 

geG 

which is obviously the trivial representation. 

Since we know C[G] contains all irreducible representations up to equivalence and using Schur's lemma, we 
conclude that 

£/>(<?) 

geG 

for irreducible representations is zero if it's not the trivial irreducible representation; and it's of course IGI1 if the 
irreducible representation is trivial. 

Given two irreducible representations V. and V., we can construct a G-representation 

this time not as a GxG representation but an ordinary G-representation. See direct product of representations. Then, 

Xv&Vfig) = XvA^XvM = XvX9Txv k {g)- 
It can be shown that any irreducible representation can be turned into a unitary irreducible representation. So, the 
direct product of two irreducible representations can also be turned into a unitary representations and now, we have 
the neat orthogonality property allowing us to decompose the direct product into a direct sum of irreducible 
representations (we're also using the property that for finite dimensional representations, if you keep taking proper 
subrepresentations, you'll hit an irreducible representation eventually. There's no infinite strictly decreasing sequence 
of positive integers). See Maschke's theorem. 

If then this decomposition does not contain the trivial representation (Otherwise, we'd have a nonzero intertwiner 
from V. to V. contradicting Schur's lemma). If i=j, then it contains exactly one copy of the trivial representation 
(Schur's lemma states that if A and B are two intertwiners from V. to itself, since they're both multiples of the 
identity, A and B are linearly dependent). Therefore, 

£ xvXgTxvAa) = £ l^lx^)**^) = !<%■■ 

gEG k 

Applying a result of linear algebra to both orthogonality relations (I C.I is always positive), we find that the number of 
conjugacy classes is greater than or equal to the number of inequivalent irreducible representations; and also at the 
same time less than or equal to. The conclusion, then, is that the number of conjugacy classes of G is the same as the 
number of inequivalent irreducible representations of G. 

Corollary. If two representations have the same characters, then they are equivalent. 
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Proof. Characters can be thought of as elements of a q-dimensional vector space where q is the number 
of conjugacy classes. Using the orthogonality relations derived above, we find that the q characters for 
the q inequivalent irreducible representations forms a basis set. Also, according to Maschke's theorem, 
both representations can be expressed as the direct sum of irreducible representations. Since the 
character of the direct sum of representations is the sum of their characters, from linear algebra, we see 
they are equivalent. 

We know that any irreducible representation can be turned into a unitary representation. It turns out the Hilbert space 
norm is unique up to multiplication by a positive number. To see this, note that the conjugate representation of the 
irreducible representation is equivalent to the dual irreducible representation with the Hilbert space norm acting as 
the intertwiner. Using Schur's lemma, all possible Hilbert space norms can only be a multiple of each other. 

Let p be an irreducible representation of a finite group G on a vector space V of (finite) dimension n with character %. 
It is a fact that %(g) = n if and only if p(g) = id (see for instance Exercise 6.7 from Serre's book below). A 
consequence of this is that if x is a non-trivial irreducible character of G such that %(g) = x(l) for some g±l then G 
contains a proper non-trivial normal subgroup (the normal subgroup is the kernel of p). Conversely, if G contains a 
proper non-trivial normal subgroup N, then the composition of the natural surjective group homomorphism G —> GIN 
with the regular representation of GIN produces a representation jt of G which has kernel N. Taking x to be the 
character of some non-trivial subrepresentation of jt, we have a character satisfying the hypothesis in the direct 
statement above. Altogether, whether or not G is simple can be determined immediately by looking at the character 
table of G. 

History 

The general features of the representation theory of a finite group G, over the complex numbers, were discovered by 
Ferdinand Georg Frobenius in the years before 1900. Later the modular representation theory of Richard Brauer was 
developed. 

Generalizations 

The Peter-Weyl theorem extends many results about representations of finite groups to representations of compact 
groups. 

References 

• Fulton, William; Harris, Joe (1991), Representation theory. A first course, Graduate Texts in Mathematics, 
Readings in Mathematics, 129, New York: Springer- Verlag, MR1 153249, ISBN 978-0-387-97527-6, 
ISBN 978-0-387-97495-8 

The standard graduate level reference for representations of groups in general, particularly Lie groups. 

• James, Gordon; and Liebeck, Martin (1993). Representations and Characters of Finite Groups. Cambridge: 
Cambridge University Press. ISBN 0-521-44590-6. 

A beautiful and readable introduction; designed for self study. 

• Serre, Jean-Pierre (1977). Linear Representations of Finite Groups. Springer- Verlag. ISBN 0-387-90190-6. 

A very well-written introduction to stated topic: concise and extremely readable. 



Representations of a finite group 



349 



External links 

• character of a finite group (http://planetmath.org/encyclopedia/Character2.html) at PlanetMath. 

Representations of finite groups of Lie type 

In mathematics, Deligne-Lusztig theory is a way of constructing linear representations of finite groups of Lie type 
using £-adic cohomology with compact support, introduced by Deligne & Lusztig (1976). 

Lusztig (1984) used these representations to find all representations of all finite simple groups of Lie type. 

Motivation 

Suppose that G is a reductive group defined over a finite field, with Frobenius map F. 

Macdonald conjectured that there should be a map from general position characters of F-stable maximal tori to 
irreducible representations of G (the fixed points of F). For general linear groups this was already known by the 
work of Green (1955). This was the main result proved by Deligne and Lusztig; they found a virtual representation 
for all characters of an F-stable maximal torus, which is irreducible (up to sign) when the character is in general 
position. 

When the maximal torus is split, these representations were well known and are given by parabolic induction of 
characters of the torus (extend the character to a Borel subgroup, then induce it up to G). The representations of 
parabolic induction can be constructed using functions on a space, which can be thought of as elements of a suitable 
zeroth cohomology group. Deligne and Lusztig's construction is a generalization of parabolic induction to non-split 
tori using higher cohomology groups. (Parabolic induction can also be done with tori of G replaced by Levi 
subgroups of G, and there is a generalization of Deligne-Lusztig theory to this case too.) 

Drinfel'd proved that the discrete series representations of SL 2 (F ) can be found in the ^-adic cohomology groups 

of the affine curve X defined by 
xy q -yx q = 1. 

The construction of Deligne and Lusztig is a generalization of this fundamental example to other groups. The affine 
curve X is generalized to a T F bundle over a "Deligne-Lusztig variety" where T is a maximal torus of G, and instead 
of using just the first cohomology group they use an alternating sum of £-adic cohomology groups with compact 
support to construct virtual representations. 

The Deligne-Lusztig construction is formally similar to Weyl's construction of the representations of a compact 
group from the characters of a maximal torus. The case of compact groups is easier partly because there is only one 
conjugacy class of maximal tori. The Borel-Weil-Bott construction of representations of algebraic groups using 
coherent sheaf cohomology is also similar. 

For real semisimple groups there is an analogue of the construction of Deligne and Lusztig, using Zuckerman 
functors to construct representations. 



Representations of finite groups of Lie type 



350 



Deligne-Lusztig varieties 

The construction of Deligne-Lusztig characters uses a family of auxiliary algebraic varieties X T called 
Deligne-Lusztig varieties, constructed from a reductive linear algebraic group G defined over a finite field F^. 

If B is a Borel subgroup of G and T a maximal torus of B then we write 

W 

T,B 

for the Weyl group (normalizer mod centralizer) 
N G (T)IT 

of G with respect to T, together with the simple roots corresponding to B. If B^ is another Borel subgroup with 
maximal torus T then there is a canonical isomorphism from Tto T that identifies the two Weyl groups. So we can 
identify all these Weyl groups, and call it 'the' Weyl group W of G. Similarly there is a canonical isomorphism 
between any two maximal tori with given choice of positive roots, so we can identify all these and call it 'the' 
maximal torus T of G. 

By the Bruhat decomposition 

G = BWB, 

the subgroup B. can be written as the conjugate of B by bw for some bGB and wG W (identified with W_ _) where w is 

1 * 1 ,B 

uniquely determined. In this case we say that B and B are in relative position w. 

Suppose that w is in the Weyl group of G, and write X for the smooth projective variety of all Borel subgroups of G. 
The Deligne-Lusztig variety X(w) consists of all Borel subgroups B of G such that B and F(B) are in relative 
position w. In other words it is the inverse image of the G-homogeneous space of pairs of Borel subgroups in relative 
position w, under the Lang isogeny with formula 

For example, if w=l then X(w) is 0-dimensional and its points are the rational Borel subgroups of G. 

We let T(w) be the torus T, with the rational structure for which the Frobenius is wF. The G conjugacy classes of 
F- stable maximal tori of G can be identified with the F-conjugacy classes of W, where we say wEWis ^-conjugate to 
elements of the form vwF(v) -1 for v€W. If the group G is split, so that F acts trivially on W, this is the same as 
ordinary conjugacy, but in general for non-split groups G, F may act on W via a non-trivial diagram automorphism. 
The F-stable conjugacy classes can be identified with elements of the non-abelian galois cohomology group of 
torsors 

H l (F,W). 

Fix a maximal torus T of G and a Borel subgroup B containing it, both invariant under the Frobenius map F, and 
write U for the unipotent radical of B. If we choose a representative w' of the normalizer N(T) representing w, then 
we define X'(w') to be the elements of G/U with F(u)=uw'. This is acted on freely by F(F), and the quotient is 
isomorphic to X(T). So for each character 0 of T(w) we get a corresponding local system F_ on X(w). The 
Deligne-Lusztig virtual representation 

R d (w) 

of G is defined by the alternating sum 

B°(w) = j:{-iyHi(X(w),Fe) 

i 

of /-adic compactly supported cohomology groups of X(w) with coefficients in the /-adic local system F Q . 

If T is a maximal F-invariant torus of G contained in a Borel subgroup B such that B and FB are in relative position 

6 6 6 

w then R (w) is also denoted by R ^ D , or by R as up to isomorphism it does not depend on the choice of B. 
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Properties of Deligne-Lusztig characters 

• The character of R ^does not depend on the choice of prime l±p, and if 9=1 its values are rational integers. 

F 6 

• Every irreducible character of G occurs in at least one character R (w). 

6 6' F 

• The inner product of R ^and R is equal to the number of elements of W(T,T) taking 9 to 9'. The set W(T,T') 

is the set of elements of G taking T to T under conjugation, modulo the group 7^ which acts on it in the obvious 
way (so if T-T it is the Weyl group). In particular the inner product is 9 if w and w' are not /^-conjugate. If 9 is in 
general position then R ^has norm 1 and is therefore an irreducible character up to sign. So this verifies 
Macdonald's conjecture. 

6 

• The representation R ^contains the trivial representation if and only if 9=1 (in which case the trivial 

representation occurs exactly once). 
0 

• The representation R T has dimension 

<W*J0 = 

F F F 

where U is a Sylow /^-subgroup of G , of order the largest power of p dividing IG I. 

6 

• The restriction of the character R ^to unipotent elements u does not depend on 9 and is called a Green function, 
denoted by Q_ (u) (the Green function is defined to be 9 on elements that are not unipotent). The character 
formula gives the character ofR T 'm terms of Green functions of subgroups as follows: 

where x=su is the Jordan-Chevalley decomposition of x as the product of commuting semisimple and 
unipotent elements s and u, and G is the identity component of the centralizer of s in G. In particular the 

S F 

character value vanishes unless the semisimple part of x is conjugate under G to something in the torus T. 

• The Deligne-Lusztig variety is usually affine, in particular whenever the characteristic p is larger than the Coxeter 
number h of the Weyl group. If it is affine and the character 9 is in general position (so that the Deligne-Lusztig 
character is irreducible up to sign) then only one of the cohomology groups H l (X(w),F^) is non-zero (the one with 
i equal to the length of w) so this cohomology group gives a model for the irreducible representation. In general it 
is possible for more than one cohomology group to be non-zero, for example when 9 is 1. 

Lusztig f s classification of irreducible characters 

Lusztig classified all the irreducible characters of G by decomposing such a character into a semisimple character 
and a unipotent character (of another group) and separately classifying the semisimple and unipotent characters. 

The dual group 

The representations of G are classified using conjugacy classes of the dual group of G. A reductive group over a 
finite field determines a root datum (with choice of Weyl chamber) together with an action of the Frobenius element 
on it. The dual group G of a reductive algebraic group G defined over a finite field is the one with dual root datum 
(and adjoint Frobenius action). This is similar to the Langlands dual group (or L-group), except here the dual group 
is defined over a finite field rather than over the complex numbers. The dual group has the same root system, except 
that root systems of type B and C get exchanged. 

The local Langlands conjectures state (very roughly) that representations of an algebraic group over a local field 
should be closely related to conjugacy classes in the Langlands dual group. Lusztig's classification of representations 
of reductive groups over finite fields can be thought of as a verification of an analogue of this conjecture for finite 
fields (though Langlands never stated his conjecture for this case). 
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Jordan decomposition 

In this section G will be a reductive group with connected center. 

An irreducible character is called unipotent if it occurs in some R 1 ^ and is called semisimple if its average value on 
regular unipotent elements is non-zero (in which case the average value is 1 or -1). If p is a good prime for G 
(meaning that it does not divide any of the coefficients of roots expressed as linear combinations of simple roots) 
then an irreducible character is semisimple if and only if its order is not divisible by p. 

An arbitrary irreducible character has a "Jordan decompostion": to it one can associate a semisimple character 
(corresponding to some semisimple element s of the dual group), and a unipotent representation of the centralizer of 
s. The dimension of the irreducible character is the product of the dimensions of its semisimple and unipotent 
components. 

This (more or less) reduces the classification of irreducible characters to the problem of finding the semisimple and 
the unipotent characters. 

Geometric conjugacy 

Two pairs (7,9), (7', 9') of a maximal torus 7 of G fixed by F and a character 9 of 7^ are called geometrically 

conjugate if they are conjugate under some element of G(k), where k is the algebraic closure of F .If an irreducible 

6 6' 

representation occurs in both R T and R then (7,9), (7', 9') need not be conjugate under G , but are always 
geometrically conjugate. For example if 9 = 9' = 1 and 7 and T are not conjugate, then the identity representation 
occurs in both Deligne-Lusztig characters, and the corresponding pairs (7,1), (7',1) are geometrically conjugate but 
not conjugate. 

The geometric conjugacy classes of pairs (7,9) are parameterized by geometric conjugacy classes of semisimple 
elements s of the group G of elements of the dual group G fixed by F. Two elements of G are called 
geometrically conjugate if they are conjugate over the algebraic closure of the finite field; if the center of G is 
connected this is equivalent to conjugacy in G . The number of geometric conjugacy classes of pairs (7,9) is IZ°V 
where Z° is the identity component of the center Z of G and / is the semisimple rank of G. 

Classification of semisimple characters 

In this subsection G will be a reductive group with connected center Z. (The case when the center is not connected 
has some extra complications.) 

The semisimple characters of G correspond to geometric conjugacy classes of pairs (7,9) (where 7 is a maximal 
torus invariant under F and 9 is a character of 7^); in fact among the irreducible characters occurring in the 
Deligne-Lusztig characters of a geometric conjugacy class there is exactly one semisimple character. If the center of 

F I 

G is connected there are IZ \q semisimple characters. If k is a geometric conjugacy class of pairs (7,9) then the 
character of the corresponding semisimple representation is given up to sign by 

(T,0)G«, mod G F {Rt'Rt) 

and its dimension is the p' part of the index of the centralizer of the element s of the dual group corresponding to it. 

The semisimple characters are (up to sign) exactly the duals of the regular characters, under a certain duality 
operation on generalized characters introduced by Curtis and Kawanaka. An irreducible character is called regular if 
it occurs in the Gelfand-Graev representation G , which is the representation induced from a certain 
"non-degenerate" 1 -dimensional character of the Sylow /^-subgroup. It is reducible, and any irreducible character of 
G occurs at most once in it. If k is a geometric conjugacy class of pairs (7,9) then the character of the 
corresponding regular representation is given by 
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(T,0)€«, mod G F ^^ R T) 

and its dimension is the // part of the index of the centralizer of the element s of the dual group corresponding to it 
times the /?-part of the order of the centralizer. 



Classification of unipotent characters 

These can be found from the cuspidal unipotent characters: those that cannot be obtained from decomposition of 
parabolically induced characters of smaller rank groups. The unipotent cuspidal characters were listed by Lusztig 
using rather complicated arguments. The number of them depends only on the type of the group and not on the 
underlying field; for example, there are none for groups of type A , at most 1 for all classical groups, and 13 for 
groups of type E The unipotent characters can be found by decomposing the characters induced from the cuspidal 

o 

ones, using results of Howlett and Lehrer. The number of unipotent characters depends only on the root system of 
the group and not on the field (or the center). The dimension of the unipotent characters can be given by universal 
polynomials in the order of the ground field depending only on the root system; for example the Steinberg 
representation has dimension q r , where r is the number of positive roots of the root system. 

Lusztig discovered that the unipotent characters of a group G (with irreducible Weyl group) fall into families of size 
4 n (n>0), 8, 21, or 39. The characters of each family are indexed by conjugacy classes of pairs (x,o) where x is in one 
of the groups Z/2Z n , S , respectively, and a is a representation of its centralizer. (The family of size 39 only 

occurs for groups of type E^, and the family of size 21 only occurs for groups of type F^) The families are in turn 
indexed by special representations of the Weyl group, or equivalently by 2-sided cells of the Weyl group. For 
example, the group E (F ) has 46 families of unipotent characters corresponding to the 46 special representations of 

5 q 

the Weyl group of E Q . There are 23 families with 1 character, 18 families with 4 characters, 4 families with 8 

o 

characters, and one family with 39 characters (which includes the 13 cuspidal unipotent characters). 



Examples 

Suppose that q is an odd prime power, and G is the algebraic group SE^. We describe the Deligne-Lusztig 
representations of the group SL (F ). (The representation theory of these groups was well known long before 

2 q 

Deligne-Lusztig theory.) 

The irreducible representations are: 

• The trivial representations of dimension 1. 

• The Steinberg representation of dimension q 

• The (q-3)/2 irreducible principal series representations of dimension g+1, together with 2 representations of 
dimension (g+l)/2 coming from a reducible principal series representation. 

• The (q-l)/2 irreducible discrete series representations of dimension q-1, together with 2 representations of 
dimension (q-l)/2 coming from a reducible discrete series representation. 

There are two classes of tori associated to the two elements (or conjugacy classes) of the Weyl group, denoted by 
T(l) (cyclic of order q-1) and T(w) (cyclic of order q+1). The non-trivial element of the Weyl group acts on the 
characters of these tori by changing each character to its inverse. So the Weyl group fixes a character if and only if it 
has order 1 or 2. By the orthogonality formula, R (w) is (up to sign) irreducible if 9 does not have order 1 or 2, and a 
sum of two irreducible representations if it has order 1 or 2. 

The Deligne-Lusztig variety X(l) for the split torus is O-dimensional with q+\ points, and can be identified with the 
points of 1 -dimensional projective space defined over F^. The representations R (1) are given as follows: 

• 1+Steinberg if 9=1 

• The sum of the 2 representations of dimension (g+l)/2 if 9 has order 2. 

• An irreducible principal series representation if 9 has order greater than 2. 
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The Deligne-Lusztig variety X(w) for the non-split torus is 1 -dimensional, and can be identified with the complenent 
of X(l) in 1 -dimensional projective space. So it is the set of points (x:y) of projective space not fixed by the 
Frobenius map (x:y)—> (x q :y q ), in other words the points with 

xy q -yx q ^0 
Drinf eld's variety of points (x ,y) of affine space with 
xy q — yx q = 1 

maps to X(w) in the obvious way, and is acted on freely by the group of g+lth roots X of 1 (which can be identified 

with the elements of the non-split torus that are defined over F ), with X taking (x,y) to (kx^y). The Deligne Lusztig 

q 6 

variety is the quotient of Drinf eld's variety by this group action. The representations -R (w) are given as follows: 

• Steinberg- 1 if 9=1 

• The sum of the 2 representations of dimension (q-l)/2 if 9 has order 2. 

• An irreducible discrete series representation if 9 has order greater than 2. 

The unipotent representations are the trivial representation and the Steinberg representation, and the semisimple 
representations are all the representations other than the Steinberg representation. (In this case the semisimple 
representations do not correspond exactly to geometric conjugacy classes of the dual group, as the center of G is not 
connected.) 

Intersection cohomology and character sheaves 

Lusztig (1985) replaced the £-adic cohomology used to define the Deligne-Lusztig representations with intersection 
£-adic cohomology, and introduced certain perverse sheaves called character sheaves. The representations defined 
using intersection cohomology are related to those defined using ordinary cohomology by Kazhdan-Lusztig 
polynomials. The F-invariant irreducible character sheaves are closely related to the irreducible characters of the 
group G . 

References 

• Carter, Roger W. (2006), "A survey of the work of George Lusztig" ^\ Nagoya Mathematical Journal 182: 1-45. 

• Carter, Roger W. (1993), Finite Groups of Lie Type: Conjugacy Classes and Complex Characters, Wiley Classics 
Library, Chichester: Wiley, ISBN 0471941093 

• Roger W. Carter (2001), "Deligne-Lusztig characters" , in Hazewinkel, Michiel, Encyclopaedia of 
Mathematics, Springer, ISBN 978-1556080104 

• Carter, Roger W. (1995), "On the representation theory of the finite groups of Lie type over an algebraically 
closed field of characteristic 0", Algebra IX, Encyclopaedia Math. Sci., 77, Berlin: Springer, pp. 1-120, 235-239,, 
MR1 170353, ISBN 3540570381 

• Digne, Francois; Michel, Jean (1991), Representations of Finite Groups of Lie Type, London Mathematical 
Society Student Texts, Cambridge: Cambridge University Press, ISBN 052140648X 

• Deligne, P.; Lusztig, G. (1976), "Representations of reductive groups over finite fields." , Ann. Of Math. (2) 
(The Annals of Mathematics, Vol. 103, No. 1) 103 (1): 103-161, doi:10.2307/1971021. MR0393266 

Ml 

• Green, J. A. (1955), "The characters of the finite general linear group" , Trans. A. M. S. (Transactions of the 
American Mathematical Society, Vol. 80, No. 2) 80 (2): 402-447, doi: 10.2307/1992997. 

• Lusztig, George (1984), Characters of Reductive Groups over a Finite Field, Annals of Mathematics Studies, 
Princeton, N.J.: Princeton University Press, ISBN 0691083509 

• Lusztig, G. (1984), "Intersection cohomology complexes on a reductive group", Invent. Math. 75: 205-272, 
doi:10.1007/BF01388564 

• Lusztig, G. (1987), Introduction to character sheaves, Proc. Symp. Pure Math., 47, Amer. Math. Soc, 
pp. 165-179 



Representations of finite groups of Lie type 



355 



• Lusztig, G. (1991), "Intersection cohomology methods in representation theory", Proc. Internat. Congr. Math. 
Kyoto 1990, /, Springer, pp. 155-174 

• Lusztig, G. (1985), "Character sheaves I", Adv. Math. 56: 193-237, doi:http://www. sciencedirect.com/science/ 
article/B6W9F-4CRY4BS-HC/2/4a5603bab245a32731782ff952d2edl6, Adv. Math. , 57 (1985) 226-265 [5] ; 
Adv. Math. , 57 (1985)266-315 [6] ; Adv. Math. , 59 (1986) 1-63 [7] ; Adv. Math. , 61 (1986) 103-155 [8] 

• Srinivasan, Bhama (1979), Representations of finite Chevalley groups. A survey., Lecture Notes in Mathematics, 
764, Berlin-New York: Springer, ISBN 3-540-09716-3 MR0551499 

References 

[1] http://projecteuclid.org/Dienst/UI/l .O/Summarize/euclid.nmj/l 150810003 
[2] http ://eom. springer. de/D/d 1 20 1 00 . htm 

[3] http://linksjstor.org/sici?sici=0003-486X%28197601%292%3A103%3Al%3C103%3ARORGOF%3E2.0.CO 
[4] http://links.jstor.org/sici?sici=0002-9947%28195511%2980%3A2%3C402%3ATCOTFG%3E2.0.C^ 
[5] http://www.sciencedirectx 
[6] http://www.sciencedirectx 

[7] http://www.sciencedirectxom/science/article/B6W9F-4CRYlXB-9N/2/b72a6cdffa52d88f661d7c 
[8] http://www.sciencedirectxo 

Representations of Lie groups 

In mathematics and theoretical physics, the idea of a representation of a Lie group plays an important role in the 
study of continuous symmetry. A great deal is known about such representations, a basic tool in their study being the 
use of the corresponding 'infinitesimal' representations of Lie algebras. The physics literature sometimes passes over 
the distinction between Lie group representations and Lie algebra representations. 

Representations on a complex finite-dimensional vector space 

Let us first discuss representations acting on finite-dimensional complex vector spaces. A representation of a Lie 
group G on a finite-dimensional complex vector space Vis a smooth group homomorphism *P:G^Aut(V) from G to 
the automorphism group of V. 

For ^-dimensional V, the automorphism group of Vis identified with a subset of complex square-matrices of order n. 
The automorphism group of V is given the structure of a smooth manifold using this identification. The condition 
that *P is smooth, in the definition above, means that ^ is a smooth map from the smooth manifold G to the smooth 
manifold Aut(V). 

If a basis for the complex vector space V is chosen, the representation can be expressed as a homomorphism into 
GL(/i,C). This is known as a matrix representation. 

Representations on a finite-dimensional vector space over an arbitrary field 

A representation of a Lie group G on a vector space V (over a field K) is a smooth (i.e. respecting the differential 
structure) group homomorphism G^Aut(V) from G to the automorphism group of V. If a basis for the vector space 
V is chosen, the representation can be expressed as a homomorphism into GL(n,K). This is known as a matrix 
representation. Two representations of G on vector spaces V, W are equivalent if they have the same matrix 
representations with respect to some choices of bases for V and W. 

On the Lie algebra level, there is a corresponding linear mapping from the Lie algebra of G to End(V) preserving the 
Lie bracket [ , ]. See representation of Lie algebras for the Lie algebra theory. 

If the homomorphism is in fact a monomorphism, the representation is said to be faithful. 
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A unitary representation is defined in the same way, except that G maps to unitary matrices; the Lie algebra will then 
map to skew-hermitian matrices. 

If G is a compact Lie group, every finite-dimensional representation is equivalent to a unitary one. 



Representations on Hilbert spaces 

A representation of a Lie group G on a complex Hilbert space V is a group homomorphism *¥:G —> B(V) from G to 
B(V), the group of bounded linear operators of V which have a bounded inverse, such that the map GxV —> V given 
by (g,v) — » *P(g)v is continuous. 

This definition can handle representations on infinite-dimensional Hilbert spaces. Such representations can be 
found in e.g. quantum mechanics, but also in Fourier analysis as shown in the following example. 

Let G=R, and let the complex Hilbert space V be L 2 (R). We define the representation *P:R B(L 2 (R)) by 
W(r){fix)} ^f{r- l x). 

See also Wigner's classification for representations of the Poincare group. 



Classification 

If G is a semisimple group, its finite-dimensional representations can be decomposed as direct sums of irreducible 
representations. The irreducibles are indexed by highest weight; the allowable {dominant) highest weights satisfy a 
suitable positivity condition. In particular, there exists a set of fundamental weights, indexed by the vertices of the 
Dynkin diagram of G, such that dominant weights are simply non-negative integer linear combinations of the 
fundamental weights. The characters of the irreducible representations are given by the Weyl character formula. 

If G is a commutative Lie group, then its irreducible representations are simply the continuous characters of G: see 
Pontryagin duality for this case. 

A quotient representation is a quotient module of the group ring. 



Formulaic examples 



Let be a finite field of order q and characteristic p. Let G be a finite group of Lie type, that is, G is the F^-rational 
points of a connected reductive group G defined over F . For example, if n is a positive integer GL(rc, F ) and SL(n, 
F^) are finite groups of Lie type. Let J = [ _ ^ ] , where 1^ is the nxn identity matrix. Let 

Sp 2 (F q ) = {g G GL^tFJfgJg = J) . 
Then Sp(2,F^) is a symplectic group of rank n and is a finite group of Lie type. For G = GL(n, F ) or SL(n, F ) (and 
some other examples), the standard Borel subgroup B of G is the subgroup of G consisting of the upper triangular 
elements in G. A standard parabolic subgroup of G is a subgroup of G which contains the standard Borel subgroup 
B. If P is a standard parabolic subgroup of GL(?z, F^), then there exists a partition (n^ n^) of n (a set of positive 
integers % such that ni + . . . + n r = n) such that P = P(ni i ...,n r ) = M X N , where 
M^GL ni (¥ q )x 



X GL nr (F q ) has the form 



M = 



0 



o 

A 2 



0\ 
0 



Aj G GL nj (F g ), 1 < j <r 



and 
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r/j, 



N = 



i 



711 

0 



*\1 

* 



V 



> , 



o ■■■ o I nr/ j 

where * denotes arbitrary entries in ¥ q . 
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Representations of Lie algebras 

In the mathematical field of representation theory, a Lie algebra representation or representation of a Lie algebra 

is a way of writing a Lie algebra as a set of matrices (or endomorphisms of a vector space) in such a way that the Lie 
bracket is given by the commutator. 

The notion is closely related to that of a representation of a Lie group. Roughly speaking, the representations of Lie 
algebras are the differentiated form of representations of Lie groups, while the representations of the universal cover 
of a Lie group are the integrated form of the representations of its Lie algebra. 



Formal definition 

A representation of a Lie algebra 0is a Lie algebra homomorphism 

from 0to the Lie algebra of endomorphisms on a vector space V^with the commutator as the Lie bracket), sending 
an element x of 0to an element of 01(1^) . 
Explicitly, this means that 

P[x,y] = \Px,Py] = PxPy ~ PyPx 
for all 2/ in 0. The vector space y, together with the representation p, is called a 0 -module. (Many authors 
abuse terminology and refer to V itself as the representation). 

One can equivalently define a 0 -module as a vector space ^together with a bilinear map q X V — > l^such that 

[x, y] ■ v = x ■ (y • v) - y ■ (x « v) 
for all x^y'm 0and v in V • This is related to the previous definition by setting x • v = p x [v). 
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Infinitesimal Lie group representations 

If (j) : G — > H is a homomorphism of Lie groups, and 0and f)are the Lie algebras of Q and H respectively, 
then the induced map 0* : g — > f) on tangent spaces is a Lie algebra homomorphism. In particular, a representation 
of Lie groups 

0 : G -> GL(V) 
determines a Lie algebra homomorphism 

<t>* ■ B -» 0 W 

from 0to the Lie algebra of the general linear group GL(V) , i.e. the endomorphism algebra of V- 
A partial converse to this statement says that every representation of a finite-dimensional (real or complex) Lie 
algebra lifts to a unique representation of the associated simply connected Lie group, so that representations of 
simply-connected Lie groups are in one-to-one correspondence with representations of their Lie algebras. 

Properties 

Representations of a Lie algebra are in one-to-one correspondence with algebra representations of the associated 
universal enveloping algebra. This follows from the universal property of that construction. 

If the Lie algebra is semisimple, then all reducible representations are decomposable. Otherwise, that's not true in 
general. 

If we have two representations, with and V 2 as their underlying vector spaces and [ ] 1 and -[-] 2 as the 
representations, then the product of both representations would have V\ ® V^as the underlying vector space and 

x[vi <8> v 2 ] = x[vi] <g> v 2 + vi ® x[v 2 ]. 
If L is a real Lie algebra and p : L X V — > Vis a complex representation of it, we can construct another 
representation of L called its dual representation as follows. 

Let V* be the dual vector space of V. In other words, V* is the set of all linear maps from V to C with addition 
defined over it in the usual linear way, but scalar multiplication defined over it such that = zu;[X]for 

any z in C, oo in V* and X in V. This is usually rewritten as a contraction with a sesquilinear form (•, ). i.e. (co,X) is 
defined to be 00 [X]. 
We define pas follows: 

< p(A)[a>LX> + <a>, PA[X]) = 0, 
for any A in L, 00 in V* and X in V. This defines p uniquely. 

Classification 

Finite-dimensional representations of semisimple Lie algebras 

Similarly to how semisimple Lie algebras can be classified, the finite-dimensional representations of semisimple Lie 
algebras can be classified. This is a classical theory, widely regarded as beautiful, and a standard reference is (Fulton 
& Harris 1992). 

Briefly, finite-dimensional representations of a semisimple Lie algebra are completely reducible, so it suffices to 
classify irreducible (simple) representations. Semisimple Lie algebras are classified in terms of the weights of the 
adjoint representation, the so-called root system; in a similar manner all finite-dimensional irreducible 
representations can be understood in terms of weights; see weight (representation theory) for details. 



Representations of Lie algebras 



359 



Representation on an algebra 

If we have a Lie superalgebra L, then a representation of L on an algebra is a (not necessarily associative) Z 2 graded 
algebra A which is a representation of L as a Z 2 graded vector space and in addition, the elements of L acts as 
deri vations/antideri vations . 

More specifically, if H is a pure element of L and x and y are pure elements of A, 

H[xy] = (H[x])y + (-l) xH x(H[y]) 
Also, if A is unital, then 

H[1] = 0 

Now, for the case of a representation of a Lie algebra, we simply drop all the gradings and the (-1) to the some 
power factors. 

A Lie (super)algebra is an algebra and it has an adjoint representation of itself. This is a representation on an algebra: 
the (anti)derivation property is the superJacobi identity. 

If a vector space is both an associative algebra and a Lie algebra and the adjoint representation of the Lie algebra on 
itself is a representation on an algebra (i.e., acts by derivations on the associative algebra structure), then it is a 
Poisson algebra. The analogous observation for Lie superalgebras gives the notion of a Poisson superalgebra. 
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Representations of the Poincare group 

In mathematics, the representation theory of the Poincare group is an example of the representation theory of a 
Lie group that is neither a compact group nor a semisimple group. It is fundamental in theoretical physics. 

In a physical theory having Minkowski space as the underlying spacetime, the space of physical states is typically a 
representation of the Poincare group. (More generally, it may be a projective representation, which amounts to a 
representation of the double cover of the group.) 

In a classical field theory, the physical states are sections of a Poincare-equivariant vector bundle over Minkowski 
space. The equivariance condition means that the group acts on the total space of the vector bundle, and the 
projection to Minkowski space is an equi variant map. Therefore the Poincare group also acts on the space of 
sections. Representations arising in this way (and their subquotients) are called covariant field representations, and 
are not usually unitary. 

For a discussion of such unitary representations, see Wigner's classification. 

In quantum mechanics, the state of the system is determined by the Schrodinger equation, which is only invariant 
under Galilean transformations. Quantum field theory is the relativistic extension of quantum mechanics, where 
relativistic (Lorentz/Poincare invariant] wave equations are solved, "quantized", and act on a Hilbert space composed 
of Fock states; eigenstates of the theory's Hamiltonian which are states with a definite number of particles with 
individual 4-momentum. There are no finite unitary representations of the full Lorentz (and thus Poincare) 
transformations due to the nature of Lorentz boosts (rotations in Minkowski space along a space and time axis. 
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Representation theory of the Lorentz group 

The Lorentz group of theoretical physics has a variety of representations, corresponding to particles with integer and 
half-integer spins in quantum field theory. These representations are normally constructed out of spinors. 

The group may also be represented in terms of a set of functions defined on the Riemann sphere, these are the 
Riemann P-functions, which are expressible as hypergeometric functions. An important special case is the subgroup 
SO(3), where these reduce to the spherical harmonics, and find practical application in the theory of atomic spectra. 

The Lorentz group has no unitary representation of finite dimension, except for the trivial representation (where 
every group element is represented by 1). 

Finding representations 

According to general representation theory of Lie groups, one first looks for the representations of the 
complexification of the Lie algebra of the Lorentz group. A convenient basis for the Lie algebra of the Lorentz group 

is given by the three generators of rotations /W^L.. and the three generators of boosts K*=L. where i, j, and k run 

i] it 

over the three spatial coordinates and 8 is the Levi-Civita symbol for a three dimensional spatial slice of Minkowski 
space. Note that the three generators of rotations transform like components of a pseudo vector J and the three 
generators of boosts transform like components of a vector K under the adjoint action of the spatial rotation 
subgroup. 

This motivates the following construction: first complexify, and then change basis to the components of A = (J + i 
K)/2 and B = (J — / K)/2. In this basis, one checks that the components of A and B satisfy separately the 
commutation relations of the Lie algebra sl 2 and moreover that they commute with each other. In other words, one 
has the isomorphism 

50(3,1) ®C=*[ 2 (C)©5l 2 (C). 

The utility of this isomorphism comes from the fact that sl 2 is the complexification of the rotation algebra, and so its 
irreducible representations correspond to the well-known representations of the spatial rotation group; for each j in 
ViTj, one has the (2/+l)-dimensional spin-j representation spanned by the spherical harmonics with j as the highest 
weight. Thus the finite dimensional irreducible representations of the Lorentz group are simply given by an ordered 
pair of half-integers (m,n) which fix representations of the subalgebra spanned by the components of A and B 
respectively. 

Properties of the (m,n) irrep 

Since the angular momentum operator is given by J = A + B, the highest weight of the rotation subrepresentation 
will be m+n. So for example, the (1/2,1/2) representation has spin 1. The (m,n) representation is 
(2m+ 1 )(2n+ 1 )-dimensional . 

Common representations 

• (0,0) the Lorentz scalar representation. This representation is carried by relativistic scalar field theories. 

• (1/2,0) is the left-handed Weyl spinor and (0,1/2) is the right-handed Weyl spinor representation. 

• (1/2,0) © (0,1/2) is the bispinor representation (see also Dirac spinor). 

• (1/2,1/2) is the four- vector representation. The electromagnetic vector potential lives in this rep. It is a 1-form 
field. 

• (1,0) is the self-dual 2-form field representation and (0,1) is the anti-self-dual 2-form field representation. 

• (1,0) © (0,1) is the representation of a parity invariant 2-form field. The electromagnetic field tensor transforms 
under this representation. 
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• (1,1/2) © (1/2,1) is the Rarita-Schwinger field representation. 

• (1,1) is the spin-2 representation of the traceless metric tensor. 

Full Lorentz group 

The (m,n) representation is irreducible under the restricted Lorentz group (the identity component of the Lorentz 
group). When one considers the full Lorentz group, which is generated by the restricted Lorentz group along with 
time and parity reversal, not only is this not an irreducible representation, it is not a representation at all, unless m=n. 
The reason is that this representation is formed in terms of the sum of a vector and a pseudovector, and a parity 
reversal changes the sign of one, but not the other. The upshot is that a vector in the (m,n) representation is carried 
into the (n,m) representation by a parity reversal. Thus (m,n)®(n,m) gives an irrep of the full Lorentz group. When 
constructing theories such as QED which is invariant under parity reversal, Dirac spinors may be used, while 
theories that do not, such as the electroweak force, must be formulated in terms of Weyl spinors. 

Infinite dimensional unitary representations 

History 

The Lorentz group SO(3,l) and its double cover SL(2,C) also have infinite dimensional unitary representations, first 
studied independently by Bargmann (1947), Gelfand & Naimark (1947) and Harish-Chandra (1947) (at the 
instigation of Paul Dirac). The Plancherel formula for these groups was first obtained by Gelfand and Naimark 
through involved calculations. The treatment was subsequently considerably simplified by Harish-Chandra (1951) 
and Gelfand & Graev (1953), based on an analogue for SL(2,C) of the integration formula of Hermann Weyl for 
compact Lie groups. Elementary accounts of this approach can be found in Riihl (1970) and Knapp (2001). 

The theory of spherical functions for the Lorentz group, required for harmonic analysis on the 3-dimensional 
hyperboloid in Minkowski space, or equivalently 3-dimensional hyperbolic space, is considerably easier than the 
general theory. It only involves representations from the spherical principal series and can be treated directly, 
because in radial coordinates the Laplacian on the hyperboloid is equivalent to the Laplacian on R. This theory is 
discussed in Takahashi (1963), Helgason (1968), Helgason (2000) and the posthumous text of Jorgenson & Lang 
(2008). 

Principal series 

The principal series, or unitary principal series, are the unitary representations induced from the one-dimensional 
representations of the lower triangular subgroup B of G = SL(2,C). Since the one-dimensional representations of B 
correspond to the representations of the diagonal matrices, with non-zero complex entries z and and thus have 
the form 

for k an integer and v real. The representations are irreducible; the only repetitions occur when k is replaced by —k. 

2 2 
By definition the representations are realised on L sections of line bundles on G I B = S , the Riemann sphere. 

When k - 0, these representations constitute the so-called spherical principal series. 

The restriction of a principal series to the maximal compact subgroup K = SU(2) of G can also be realised as an 
induced representation of K using the identification G I B = K I T, where T = B Pi K is the maximal torus in K 
consisting of diagonal matrices with lzl=l. It is the representation induced from the 1 — dimensional representation z 
7, and is independent of v. By Frobenius reciprocity, on K they decompose as a direct sum of the irreducible 
representations of K with dimensions \k\ + 2m +1 with m a non-negative integer. 

Using the identification between the Riemann sphere minus a point and C, the principal series can be defined 

2 

directly on L (C) by the formula 
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Irreducibility can be checked in a variety of ways: 

• The representation is already irreducible on B. This can be seen directly, but is also a special case of general 
results on ireducibility of induced representations due to Francois Bruhat and George Mackey, relying on the 

Bruhat decomposition G = B U B s B where s is the Weyl group element r 1 l l j 



c -.*> i 



• The action of the Lie algebra 0of G can be computed on the algebraic direct sum of the irreducible subspaces of 
K can be computed explicitly and the it can be verified directly that the lowest dimensional subspace generates 
this direct sum as a 0 -module. 

Complementary series 

2 

The for 0 < t < 2, the complementary series is defined on L functions /on C for the inner product 



(>•»>=// 



f(z)g(w) dz dw 



with the action given by 



C 3" 



*C "A M = \cz + d\- 2 - t f 



az + fe N 



cz + d J 

The complementary series are irreducible and inequivalent. As a representation of K, each is isomorphic to the 

Hilbert space direct sum of all the odd dimensional irreducible representations of K = SU(2). Irreducibility can be 

proved by analysing the action of 0on the algebraic sum of these subspaces or directly without using the Lie 
i u [4] [5] 

algebra. 

Plancherel theorem 

The only irreducible unitary representations of SL(2,C) are the principal series, the complementary series and the 

k 

trivial representation. Since -/ acts ( -1) on the principal series and trivially on the remainder, these will give all the 

irreducible unitary representations of the Lorentz group, provided k is taken to be even. 

2 

To decompose the left regular representation of G on L (G), only the principal series are required. This immediately 

2 

yields the decomposition on the subrepresentations L (G/ ±1), the left regular representation of the Lorentz group, 

2 

and L (G / K), the regular representation on 3-dimensional hyperbolic space. (The former only involves principal 

series representations with k even and the latter only those with k = 0.) 

2 

The left and right regular representation X and p are defined on L (G) by 

%)/(*) = /GT 1 *), P(9)f(x) = f(xg). 

Now if /is an element of C c (G), the operator k (f) defined by 

*v,fc(/) = / f(g)n{g) dg 

Jg 

is Hilbert-Schmidt. We define a Hilbert space H by 

H = 0 HS{L\C)) ® L 2 (i?, c k {v 2 + k 2 ) l / 2 dv), 

k>0 

where 

c 0 = 1/47T 3 / 2 , c k = 1/(2tt) 3 / 2 (k ^ 0) 

2 2 T61 

and HS(L (C)) denotes the Hilbert space of Hilbert-Schmidt operators on L (C). Then the map U defined on 

C c (G)by 

U(f)(u,k)=7T l/ , k {f) 
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2 

extends to a unitary of L (G) onto H. 



The map U satisfies 

U(\(x)p(y)f)(u, k) = ^uM'^M^Av)' 

If/ r / 2 arein C (G) then 

/oc 
Tr(7r I/ife (/i)7r I/;fc (/ 2 )*)(^ 2 + fc 2 ) dz/. 
-OO 



fc>0 



Thus if f=f 1 * f* denotes the convolution of and/ 2 *, where f£(g) = f2{g~ 1 ) > men 

/oo 
Tr(^(/))(^ + fc 2 )^. 

The last two displayed formulas are usually referred to as the Plancherel formula and the Fourier inversion 

formula respectively. The Plancherel formula extends to allf in L 2 (G). By a theorem of Jacques Dixmier and Paul 

Malliavin, every function fin C^°(G) is a finite sum of convolutions of similar functions, the inversion formula 

T71 

holds for such/. It can be extended to much wider classes of functions satisfying mild differentiability conditions. 1 J 

See also 

• Poincare group 

• Wigner's classification 



Notes 

[1] Knapp 2001, Chapter II 

[2] Harish-Chandra 1947 

[3] Taylor 1986 

[4] Gelf and & Naimark 1947 

[5] Takahashi 1963, p. 343 

[6] Note that for a Hilbert space H, HS(H) may be identified canonically with the Hilbert space tensor product of H and its conjugate space. 

[7] Knapp 2001 
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Double group 

In crystallography, the space group (or crystallographic group, or Fedorov group) of a crystal is a description of 
the symmetry of the crystal, and can have one of 230 types. In mathematics space groups are also studied in 
dimensions other than 3 where they are sometimes called Bieberbach groups, and are discrete cocompact groups of 
isometries of an oriented Euclidean space. 

A definitive source regarding 3-dimensional space groups is the International Tables for Crystallography (Hahn 
(2002)). 

History 

The space groups in 3 dimensions were first enumerated by Fyodorov (1891), and shortly afterwards were 
independently enumerated by Barlow (1894) and Schonflies (1891). These first enumerations all contained several 
minor mistakes, and the correct list of 230 space groups was found during correspondence between Fyodorov and 
Schonflies. 

Space groups in 2 dimensions are the 17 wallpaper groups which have been known for several centuries. 

Elements of a space group 

The space groups in three dimensions are made from combinations of the 32 crystallographic point groups with the 
14 Bravais lattices which belong to one of 7 lattice systems. This results in a space group being some combination of 
the translational symmetry of a unit cell including lattice centering, the point group symmetry operations of 
reflection, rotation and improper rotation (also called rotoin version), and the screw axis and glide plane symmetry 
operations. The combination of all these symmetry operations results in a total of 230 unique space groups 
describing all possible crystal symmetries. 
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Elements fixing a point 

The elements of the space group fixing a point of space are rotations, reflections, the identity element, and improper 
rotations. 

Translations 

The translations form a normal abelian subgroup of rank 3, called the Bravais lattice. There are 14 possible types of 
Bravais lattice. The quotient of the space group by the Bravais lattice is a finite group which is one of the 32 possible 
point groups. 

Glide planes 

A glide plane is a reflection in a plane, followed by a translation parallel with that plane. This is noted by a, b or c, 
depending on which axis the glide is along. There is also the n glide, which is a glide along the half of a diagonal of a 
face, and the d glide, which is a fourth of the way along either a face or space diagonal of the unit cell. The latter is 
called the diamond glide plane as it features in the diamond structure. 

Screw axes 

A screw axis is a rotation about an axis, followed by a translation along the direction of the axis. These are noted by 
a number, n, to describe the degree of rotation, where the number is how many operations must be applied to 
complete a full rotation (e.g., 3 would mean a rotation one third of the way around the axis each time). The degree of 
translation is then added as a subscript showing how far along the axis the translation is, as a portion of the parallel 
lattice vector. So, 2 is a twofold rotation followed by a translation of 1/2 of the lattice vector. 

Notation for space groups 

There are at least eight methods of naming space groups. Some of these methods can assign several different names 
to the same space group, so altogether there are many thousands of different names. 

• Number. The International Union of Crystallography publishes tables of all space group types, and assigns each a 
unique number from 1 to 230. The numbering is arbitrary, except that groups with the same crystal system or 
point group are given consecutive numbers. 

• International symbol or Hermann-Mauguin notation. The Hermann-Mauguin (or international) notation 
describes the lattice and some generators for the group. It has a shortened form called the international short 
symbol, which is the one most commonly used in crystallography, and usually consists of a set of four symbols. 
The first describes the centering of the Bravais lattice (P, A,B, C, I, R or F). The next three describe the most 
prominent symmetry operation visible when projected along one of the high symmetry directions of the crystal. 
These symbols are the same as used in point groups, with the addition of glide planes and screw axis, described 
above. By way of example, the space group of quartz is P3 1 21, showing that it exhibits primitive centering of the 
motif (i.e., once per unit cell), with a threefold screw axis and a twofold rotation axis. Note that it does not 
explicitly contain the crystal system, although this is unique to each space group (in the case of 7^21, it is 
trigonal). 

In the international short symbol the first symbol (3 1 in this example) denotes the symmetry along the major 
axis (c-axis in trigonal cases), the second (2 in this case) along axes of secondary importance (a and b) and the 
third symbol the symmetry in another direction. In the trigonal case there also exists a space group P3 1 12. In 
this space group the twofold axes are not along the a and b-axes but in a direction rotated by 30°. 

The international symbols and international short symbols for some of the space groups were changed slightly 
between 1935 and 2002, so several space groups have 4 different international symbols in use. 
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• Hall notation. Space group notation with an explicit origin. Rotation, translation and axis-direction symbols are 
clearly separated and inversion centers are explicitly defined. The construction and format of the notation make it 
particularly suited to computer generation of symmetry information. For example, group number 3 has three Hall 
symbols: P 2y (P 1 2 1), P 2 (P 1 1 2), P 2x (P 2 1 1). 

• Schonflies notation. The space groups with given point group are numbered by 1, 2, 3, ... (in the same order as 

their international number) and this number is added as a superscript to the Schonflies symbol for the point group. 

12 3 

For example, groups numbers 3 to 5 whose point group is have Schonflies symbols C 2 , C 2 , C 2< 

• Shubnikov symbol 

• 2D:Orbifold notation and 3D:Fibrifold notation. As the name suggests, the orbifold notation describes the 
orbifold, given by the quotient of Euclidean space by the space group, rather than generators of the space group. It 
was introduced by Conway and Thurston, and is not used much outside mathematics. Some of the space groups 
have several different fibrifolds associated to them, so have several different fibrifold symbols. 



Classification systems for space groups 

There are (at least) 10 different ways to classify space groups into classes. The relations between some of these are 
described in the following table. Each classification system is a refinement of the ones below it. 

(Crystallographic) space group types (230 in three dimensions). Two space groups, considered as subgroups of the group of affine 
transformations of space, have the same space group type if they are conjugate by an orientation-preserving affine transformation. In three 
dimensions, for 1 1 of the affine space groups, there is no orientation-preserving map from the group to its mirror image, so if one distinguishes 
groups from their mirror images these each split into two cases. So there are 54+11=65 space group types that preserve orientation. 



Affine space group types (219 in three dimensions). Two space groups, considered as subgroups of the group of affine transformations of space, 
have the same affine space group type if they are conjugate under an affine transformation. The affine space group type is determined by the 
underlying abstract group of the space group. In three dimensions there are 54 affine space group types that preserve orientation. 

Arithmetic crystal classes (73 in three dimensions). These are determined by the point group together with the action of the point group on the 
subgroup of translations. In other words the arithmetic crystal classes correspond to conjugacy classes of finite subgroup of the general linear group 
GL n (Z) over the integers. A space group is called symmorphic (or split) if there is a point such that all symmetries are the product of asymmetry 
fixing this point and a translation. Equivalently, a space group is symmorphic if it is a semidirect product of its point group with its translation 
subgroup. There are 73 symmorphic space groups, with exactly one in each arithmetic crystal class. There are also 157 nonsymmorphic space group 
types with varying numbers in the arithmetic crystal classes. 



(geometric) Crystal classes (32 in three dimensions). The crystal class of a space Bravais flocks (14 in three dimensions). These are determined 

group is determined by its point group: the quotient by the subgroup of by the underlying Bravais lattice type. These correspond to 

translations, acting on the lattice. Two space groups are in the same crystal class conjugacy classes of lattice point groups in GL 2 (Z), where the 

if and only if their point groups, which are subgroups of GL 2 (Z), are conjugate in lattice point group is the group of symmetries of the underlying 

the larger group GL 2 (Q). lattice that fix a point of the lattice, and contains the point 

group. 

Crystal systems. (7 in three dimensions) Crystal systems are an ad hoc Lattice systems (7 in three dimensions). The lattice system of a 

modification of the lattice systems to make them compatible with the space group is determined by the conjugacy class of the lattice 

classification according to point groups. They differ from crystal families in that point group (a subgroup of GL 2 (Z)) in the larger group GL 2 (Q). 

the hexagonal crystal family is split into two subsets, called the trigonal and In three dimensions the lattice point group can have one of the 7 

hexagonal crystal systems. The trigonal crystal system is larger than the different orders 2, 4, 8, 12, 16, 24, or 48. The hexagonal crystal 

rhombohedral lattice system, the hexagonal crystal system is smaller than the family is split into two subsets, called the rhombohedral and 

hexagonal lattice system, and the remaining crystal systems and lattice systems hexagonal lattice systems, 
are the same. 



Crystal families (6 in three dimensions). The point group of a space group does not quite determine its lattice system, because occasionally two 
space groups with the same point group may be in different lattice systems. Crystal families are formed from lattice systems by merging the two 
lattice systems whenever this happens, so that the crystal family of a space group is determined by either its lattice system or its point group. In 3 
dimensions the only two lattice families that get merged in this way are the hexagonal and rhombohedral lattice systems, which are combined into 
the hexagonal crystal family. The 6 crystal families in 3 dimensions are called triclinic, monoclinic, orthorhombal, tetragonal, hexagonal, and cubic. 
Crystal families are commonly used in popular books on crystals, where they are sometimes called crystal systems. 
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Conway, Delgado Friedrichs, and Huson et al. (2001) gave another classification of the space groups, according to 
the fibrifold structures on the corresponding orbifold. They divided the 219 affine space groups into reducible and 
irreducible groups. The reducible groups fall into 17 classes corresponding to the 17 wallpaper groups, and the 
remaining 35 irreducible groups are the same as the cubic groups and are classified separately. 

Space groups in other dimensions 
Bieberbach's theorems 

In n dimensions, an affine space group, or Bieberbach group, is a discrete subgroup of isometries of /i-dimensional 
Euclidean space with a compact fundamental domain. Bieberbach (1911, 1912) proved that the subgroup of 
translations of any such group contains n linearly independent translations, and is a free abelian subgroup of finite 
index, and is also the unique maximal normal abelian subgroup. He also showed that in any dimension n there are 
only a finite number of possibilities for the isomorphism class of the underlying group of a space group, and 
moreover the action of the group on Euclidean space is unique up to conjugation by affine transformations. This 
answers part of Hilbert's 18th problem. Zassenhaus (1948) showed that conversely any group that is the extension of 
ll 1 by a finite group acting faithfully is an affine space group. Combining these results shows that classifying space 
groups in n dimensions up to conjugation by affine transformations is essentially the same as classifying 
isomorphism classes for groups that are extensions of IT by a finite group acting faithfully. 

It is essential in Bieberbach's theorems to assume that the group acts as isometries; the theorems do not generalize to 
discrete cocompact groups of affine transformations of Euclidean space. A counter-example is given by the 
3 -dimensional Heisenberg group of the integers acting by translations on the Heisenberg group of the reals, 
identified with 3-dimensional Euclidean space. This is a discrete cocompact group of affine transformations of space, 
but does not contain a subgroup Z . 



Classification in small dimensions 

This table give the number of space group types in small dimensions. 



Dimension 


Lattice types 
(sequence 

A004030 [1] 
in OEIS) 


point groups 
(sequence 

A004028 1 1 
in OEIS) 


Crystallographic 
space group types 
(sequence A006227 [3] 
in OEIS) 


Affine space 

group types 

(sequence 
T41 

A004029 L J in 
OEIS) 


Classification 


0 


1 


1 


1 


1 


Trivial group 


1 


1 


2 


2 


2 


One is the group of integers and the other is the infinite 
dihedral group ;see symmetry groups in one dimension 


2 


5 


10 


17 


17 


these 2D space groups are also called wallpaper groups 
or plane groups. 


3 


14 


32 


230 


219 


In 3D there are 230 crystallographic space group types, 
which reduces to 219 affine space group types because of 
some types being different from their mirror image; these 

are said to differ by "enantiomorphous character" (e.g. 

P3 1 12 and P3 2 12). Usually "space group" refers to 3D. 
They were enumerated independently by Barlow (1894), 
Fedorov (1891) and Schonflies (1891). 


4 


64 


227 


4895 


4783 


The 4895 4-dimensional groups were enumerated by 
Harold Brown, Rolf Bulow, and Joachim Neubtiser et 
al. (1978). 
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5 


189 


955 




222018 


Plesken & Schulz (2000) enumerated the ones of 
dimension 5 


6 




7104 


28934974 


28927922 


Plesken & Schulz (2000) enumerated the ones of 
dimension 6 



Double groups and time reversal 

In addition to crystallographic space groups there are also magnetic space groups or double groups. These 
symmetries contain an element known as time reversal. They are of importance in magnetic structures that contain 
ordered unpaired spins, i.e. ferro-, ferri- or antiferromagnetic structures as studied by neutron diffraction. The time 
reversal element flips a magnetic spin while leaving all other structure the same and it can be combined with a 
number of other symmetry elements. Including time reversal there are 1651 magnetic space groups in 3D (Kim 1999, 
p.428). 



Table of space groups in 3 dimensions 





Point group 


# 






Hermann-Mauguin 


Schonflies 






Triclinic (2) 


1 




1 


PI 




1 


c. 


2 


PI 


Monoclinic 
(13) 


2 




3-5 


P2, P2 r C2 


m 


c 

s 


6-9 


Pm, Pc, Cm, Cc 




2/m 


C 2h 


10-15 


P2/m, P2 i /m, C2/m, P2/c, P2 i /c, C2/c 


Orthorhombic 

(59) 


222 


D 2 


16-24 


P222, P222 , P2 2 2, P2 2 2 C222 , C222, F222, 1222, 12 2 2 


mm2 


C 2v 


25-46 


Pmm2, Pmc2 , Pcc2, Pma2, Pca2 , Pnc2, Pmn2 , Pba2, Pna2 , Pnn2, Cmm2, 
Cmc2 , Ccc2, Amm2, Aem2, Ama2, Aea2, Fmm2, Fdd2, Imm2, Iba2, Ima2 




mmm 


D 2h 


47-74 


Pmmm, Pnnn, Pccm, Pban, Pmma, Pnna, Pmna, Pcca, Pbam, Pccn, Pbcm, Pnnm, 
Pmmn, Pbcn, Pbca, Pnma, Cmcm, Cmce, Cmmm, Cccm, Cmme, Ccce, Fmmm, 
Fddd, Immm, Ibam, Ibca, Imma 


Tetragonal (68) 


4 


C 4 


75-80 


P4, P4 , P4 , P4 , 14, 14 

r 2 31 




4 


S 4 


81-82 


P4, 14 




4/m 


C 4h 


83-88 


P4/m, P4 /m, P4/n, P4 /n, I4/m, 14 /a 

2 2 1 




422 


D 4 


89-98 


P422, P42 2, P4 22, P4 2 2, P4 22, P4 2 2, P4 22, P4 2 2, 1422, 14 22 

1 1 11 2 21 3 31 1 




4mm 


C 4v 


99-110 


P4mm, P4bm, P4 cm, P4 nm, P4cc, P4nc, P4 mc, P4 be, Mmm, 14cm, 14 md, 14 cd 

2 2 2 2 11 




42m 


D 2d 


111-122 


P42m, P42c, P42 x m, P42 c, P4m2, P4c2, P4b2, P4n2, 14m2, 14c2, 142m, I42d 




4/m mm 


D 4h 


123-142 


P4/mmm, P4/mcc, P4/nbm, P4/nnc, P4/mbm, P4/mnc, P4/nmm, P4/ncc, P4 2 /mmc, 
P4 /mem, P4 /nbc, P4 /nnm, P4 /mbc, P4 /mnm, P4 /nmc, P4 /ncm, I4/mmm, 

2 '2 2 2 2 2 2 

I4/mcm, 14 /amd, 14 /acd 


Trigonal (25) 


3 


C 3 


143-146 


P3,P3 r P3 2 , R3 




3 


S a 


147-148 


P3,R3 




32 


D 3 


149-155 


P312,P321,P3 12, P3 21, P3 12, P3 21, R32 

1 1 2 2 




3m 


Sv 


156-161 


P3ml, P31m, P3cl, P31c, R3m, R3c 




3m 


D 3d 


162-167 


P31m, P31c, P3ml, P3cl, R3m, R3c, 
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Hexagonal (27) 


6 


C 

6 


168-173 


P6, P6 1? P6. P6„P6„P6 0 

1' 5' 2' 4' 3 


6 


3h 


174 


P6 


6/m 


6h 


175-176 


P6/m, P6 /m 

3 


622 


D 

6 


177-182 


P622, P6 22, P6 22, P6 22, P6 22, P6 22 

1 5 2 4 3 


6mm 


c 

6v 


183-186 


P6mm, P6cc, P6 cm, P6 mc 

3 3 


6m2 


D OL 

3h 


187-190 


P6m2, P6c2, P62m, P62c 


6/m mm 


6h 


191-194 


P6/mmm, P6/mcc, P6 /mem, P6 /mmc 

3 3 


Cubic (36) 


23 


T 


195-199 


P23, F23, 123, P2 3, 12 3 
l l 


m3 


h 


200-206 


Pm3, Pn3, Fm3, Fd3, Im3, Pa3, Ia3 


432 


o 


207-214 


P432, P4 32, F432, F4 32, 1432, P4 32, P4 32, 14 32 

2 1 3 11 


43m 




215-220 


P43m, F43m, I43m, P43n, F43c, I43d 


m3m 


°h 


221-230 


Pm3m, Pn3n, Pm3n, Pn3m, Fm3m, Fm3c, Fd3m, Fd3c, Im3m, Ia3d 



Note. An e plane is a double glide plane, one having glides in two different directions. They are found in five space 
groups, all in the orthorhombic system and with a centered lattice. The use of the symbol e became official with 
Hahn (2002). 

The lattice system can be found as follows. If the crystal system is not trigonal then the lattice system is of the same 
type. If the crystal system is trigonal, then the lattice system is hexagonal unless the space group is one of the seven 
in the rhombohedral lattice system consisting of the 7 trigonal space groups in the table above whose name begins 
with R. (The term rhombohedral system is also sometimes used as an alternative name for the whole trigonal 
system.) The hexagonal lattice system is larger than the hexagonal crystal system, and consists of the hexagonal 
crystal system together with the 18 groups of the trigonal crystal system other than the seven whose names begin 
with R. 

The Bravais lattice of the space group is determined by the lattice system together with the initial letter of its name, 
which for the non-rhombohedral groups is P, I, F, or C, standing for the principal, body centered, face centered, or 
C-face centered lattices. 
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Noether's Theorem 



Noether's (first) theorem states that any differentiable symmetry of the action of a physical system has a 
corresponding conservation law. The theorem was proved by German mathematician Emmy Noether in 1915 and 
published in 1918. ^ The action of a physical system is the integral over time of a Lagrangian function (which may 
or may not be an integral over space of a Lagrangian density function), from which the system's behavior can be 
determined by the principle of least action. 

Noether's theorem has become a fundamental tool of modern theoretical physics and the calculus of variations. A 
generalization of the seminal formulations on constants of motion in Lagrangian and Hamiltonian mechanics (1788 
and 1833, respectively), it does not apply to systems that cannot be modeled with a Lagrangian; for example, 
dissipative systems with continuous symmetries need not have a corresponding conservation law. 

For illustration, if a physical system behaves the same regardless of how it is oriented in space, its Lagrangian is 
rotationally symmetric; from this symmetry, Noether's theorem shows the angular momentum of the system must be 
conserved. The physical system itself need not be symmetric; a jagged asteroid tumbling in space conserves angular 
momentum despite its asymmetry - it is the laws of motion that are symmetric. As another example, if a physical 
experiment has the same outcome regardless of place or time (having the same outcome, say, somewhere in Asia on 
a Tuesday or in America on a Wednesday), then its Lagrangian is symmetric under continuous translations in space 
and time; by Noether's theorem, these symmetries account for the conservation laws of linear momentum and energy 
within this system, respectively. (These examples are just for illustration; in the first one, Noether's theorem added 
nothing new - the results were known to follow from Lagrange's equations and from Hamilton's equations.) 

Noether's theorem is important, both because of the insight it gives into conservation laws, and also as a practical 
calculational tool. It allows researchers to determine the conserved quantities from the observed symmetries of a 
physical system. Conversely, it allows researchers to consider whole classes of hypothetical Lagrangians to describe 
a physical system. For illustration, suppose that a new field is discovered that conserves a quantity X. Using 
Noether's theorem, the types of Lagrangians that conserve X because of a continuous symmetry can be determined, 
and then their fitness judged by other criteria. 

There are numerous different versions of Noether's theorem, with varying degrees of generality. The original version 
only applied to ordinary differential equations (particles) and not partial differential equations (fields). The original 
versions also assume that the Lagrangian only depends upon the first derivative, while later versions generalize the 
theorem to Lagrangians depending on the n th derivative. There is also a quantum version of this theorem, known as 
the Ward-Takahashi identity. Generalizations of Noether's theorem to superspaces also exist. 

Informal statement of the theorem 

All fine technical points aside, Noether's theorem can be stated informally 

If a system has a continuous symmetry property, then there are corresponding quantities whose values are 
conserved in timeP^ 

A more sophisticated version of the theorem states that: 

To every differentiable symmetry generated by local actions, there corresponds a conserved current. 

The word "symmetry" in the above statement refers more precisely to the covariance of the form that a physical law 
takes with respect to a one-dimensional Lie group of transformations satisfying certain technical criteria. The 
conservation law of a physical quantity is usually expressed as a continuity equation. 

The formal proof of the theorem uses only the condition of invariance to derive an expression for a current 
associated with a conserved physical quantity. The conserved quantity is called the Noether charge and the flow 
carrying that 'charge' is called the Noether current. The Noether current is defined up to a solenoidal vector field. 
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Historical context 

A conservation law states that some quantity X describing a system remains constant throughout its motion; 
expressed mathematically, the rate of change of X (its derivative with respect to time) is zero: 

^ = 0. 
dt 

Such quantities are said to be conserved; they are often called constants of motion, although motion per se need not 
be involved, just evolution in time. For example, if the energy of a system is conserved, its energy is constant at all 
times, which imposes a constraint on the system's motion and may help to solve for it. Aside from the insight that 
such constants of motion give into the nature of a system, they are a useful calculational tool; for example, an 
approximate solution can be corrected by finding the nearest state that satisfies the necessary conservation laws. 

The earliest constants of motion discovered were momentum and energy, which were proposed in the 17 th century by 
Rene Descartes and Gottfried Leibniz on the basis of collision experiments, and refined by subsequent researchers. 
Isaac Newton was the first to enunciate the conservation of momentum in its modern form, and showed that it was a 
consequence of Newton's third law; interestingly, conservation of momentum still holds even in situations when 
Newton's third law is incorrect. Modern physics has revealed that the conservation laws of momentum and energy 
are only approximately true, but their modern refinements — the conservation of four-momentum in special relativity 
and the zero co variant divergence of the stress-energy tensor in general relativity - are rigorously true within the 
limits of those theories. The conservation of angular momentum, a generalization to rotating rigid bodies, likewise 
holds in modern physics. Another important conserved quantity, discovered in studies of the celestial mechanics of 
astronomical bodies, was the Laplace-Runge-Lenz vector. 

In the late 18 th and early 19 th centuries, physicists developed more systematic methods for discovering conserved 
quantities. A major advance came in 1788 with the development of Lagrangian mechanics, which is related to the 
principle of least action. In this approach, the state of the system can be described by any type of generalized 
coordinates q; the laws of motion need not be expressed in a Cartesian coordinate system, as was customary in 
Newtonian mechanics. The action is defined as the time integral / of a function known as the Lagrangian L 



dt 



where the dot over q signifies the rate of change of the coordinates q 
dq 

q= ~dt' 

Hamilton's principle states that the physical path q(t) — the one truly taken by the system - is a path for which 
infinitesimal variations in that path cause no change in /, at least up to first order. This principle results in the 
Euler-Lagrange equations 



d_ (dV\ _ dL 
dt \dcij 3q 



Thus, if one of the coordinates, say q^ does not appear in the Lagrangian, the right-hand side of the equation is zero, 
and the left-hand side shows that 

— I dL \ = dpk = Q 
dt \dqk ) dt 

where the conserved momentum p is defined as the left-hand quantity in parentheses. The absence of the coordinate 
q k from the Lagrangian implies that the Lagrangian is unaffected by changes or transformations of q^ the 
Lagrangian is invariant, and is said to exhibit a kind of symmetry. This is the seed idea from which Noether's 
theorem was born. 

Several alternative methods for finding conserved quantities were developed in the 19 th century, especially by 
William Rowan Hamilton. For example, he developed a theory of canonical transformations that allowed researchers 
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to change coordinates so that coordinates disappeared from the Lagrangian, resulting in conserved quantities. 
Another approach and perhaps the most efficient for finding conserved quantities is the Hamilton-Jacobi equation. 

Mathematical expression 

The essence of Noether's theorem is the following: Imagine that the action / defined above is invariant under small 
perturbations (warpings) of the time variable t and the generalized coordinates q; (in a notation commonly used by 
physicists) we write 

t -> t' = t + St 
q^q' = q + Jq 

where the perturbations 6t and dq are both small but variable. For generality, assume that there might be several such 
symmetry transformations of the action, say, N; we may use an index r = 1, 2, 3, . . ., N to keep track of them. Then a 
generic perturbation can be written as a linear sum of the individual types of perturbations 

5t = *£e r T r 



Using these definitions, Emmy Noether showed that the N quantities 

dL . r \ dL n 

are conserved, i.e., are constants of motion; this is a simple version of Noether's theorem. 

Examples 

For illustration, consider a Lagrangian that does not depend on time, i.e., that is invariant (symmetric) under changes 
t — » t + bt, without any change in the coordinates q. In this case, N=\, T=l and Q = 0; the corresponding 
conserved quantity is the total energy 

dL 

H=— q-L. 
dq 

Similarly, consider a Lagrangian that does not depend on a coordinate q , i.e., that is invariant (symmetric) under 
changes q — » q + bq . In that case, N=l, T=0, and Q = 1; the conserved quantity is the corresponding 
momentum/?^ 

dL 

In special and general relativity, these apparently separate conservation laws are aspects of a single conservation law, 
that of the stress-energy tensor, ^ that is derived in the next section. 

The conservation of the angular momentum L = r x p is slightly more complicated to derive, but analogous to its 
linear momentum counterpart. It is assumed that the symmetry of the Lagrangian is rotational, i.e., that the 
Lagrangian does not depend on the absolute orientation of the physical system in space. For concreteness, assume 
that the Lagrangian does not change under small rotations of an angle 69 about an axis n; such a rotation transforms 
the Cartesian coordinates by the equation 

r —> r + S6n x r. 

Since time is not being transformed, T equals zero. Taking 69 as the 8 parameter and the Cartesian coordinates r as 
the generalized coordinates q, the corresponding Q variables are given by 

Q = n x r. 

Then Noether's theorem states that the following quantity is conserved 
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— ■ Q r = p • (n x r) = n ■ (r x p) = n ■ L. 
dq 

In other words, the component of the angular momentum L along the n axis is conserved. If n is arbitrary, i.e., if the 
system is insensitive to any rotation, then every component of L is conserved; in short, angular momentum is 
conserved. 



Field-theory version 

Although useful in its own right, the version of her theorem just given was a special case of the general version she 
derived in 1915. To give the flavor of the general theorem, a version of the Noether theorem for continuous fields in 
four-dimensional space-time is now given. Since field theory problems are more common in modern physics than 
mechanics problems, this field-theory version is the most commonly used version of Noether's theorem. 

Let there be a set of differentiable fields cp defined over all space and time; for example, the temperature T(x, t) 
would be representative of such a field, being a number defined at every place and time. The principle of least action 
can be applied to such fields, but the action is now an integral over space and time 



I = J L(0, d^fr x*) d A 2 



(the theorem can actually be further generalized to the case where the Lagrangian depends on up to the n th derivative 
using jet bundles) 

Let the action be invariant under certain transformations of the space-time coordinates and the fields cp^ 
aJ* _> x v + Sx^ 
(j>^<p + 8<j> 

where the transformations can be indexed by r = 1, 2, 3, . . ., N 
8x» = e r X? 
6<j> = e r V r 

For such systems, Noether's theorem states that there are N conserved current densities 

dL 



d<t>,„. 



<t>.<, - LK 



In such cases, the conservation law is expressed in a four-dimensional way 

which expresses the idea that the amount of a conserved quantity within a sphere cannot change unless some of it 
flows out of the sphere. For example, electric charge is conserved; the amount of charge within a sphere cannot 
change unless some of the charge leaves the sphere. 

For illustration, consider a physical system of fields that behaves the same under translations in time and space, as 
considered above; in other words, the fields do not depend on the absolute position in space and time. In that case, 
N=4, one for each dimension of space and time. Since only the positions in space-time are being warped, not the 

fields, the *P are all zero and the X v equal the Kronecker delta 6 v , where we have used u instead of r for the index. 

* ^ ' v r5i 

In that case, Noether's theorem corresponds to the conservation law for the stress-energy tensor T 



rp i/ 
f 1 



' dL s 
Mr. 



' dL s 
Mr. 



0 



The conservation of electric charge can be derived by considering transformations of the fields themselves. In 
quantum mechanics, the probability amplitude ip(x) of finding a particle at a point x is a complex field, because it 
ascribes a complex number to every point in space and time. The probability amplitude itself is physically 
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2 

unmeasurable; only the probability p = li^l is directly measureable. Therefore, the system is invariant under 
transformations of the ip field and its complex conjugate field ip* that leave IiJjI 2 unchanged, such as 

In the limit when 9 becomes infinitesimally small (69), it may be taken as the 8, and the *P are equal to hp and -hp*, 
respectively. A specific example is the Klein-Gordon equation, the relativistically correct version of the Schrodinger 
equation for spinless particles, which has the Lagrangian density 

In this case, Noether's theorem states that the conserved current equals 

which, when multiplied by the charge, equals the electric current density. This transformation was first noted by 
Hermann Weyl and is one of the fundamental gauge symmetries of modern physics. 



= [ 2 L[q[t],q[t],t]dt 



Derivations 

One independent variable 

Consider the simplest case, a system with one independent variable, time. Suppose the dependent variables qare 
such that the action integral 

I 

is invariant under brief infinitesimal variations in the dependent variables. In other words, they satisfy the 
Euler-Lagrange equations 

ddL ri 5L ri 

= a# 

And suppose that the integral is invariant under a continuous symmetry. Mathematically such a symmetry is 
represented as a flow, (j) , which acts on the variables as follows 

t -> t' = t + eT 

q[t] -> q'[f ] = *[q[t], e] = 0[q[t' - eT], e] 
where e is a real variable indicating the amount of flow and T is a real constant (which could be zero) indicating 
how much the flow shifts time. 

m - m = j t mu\ = - *n w - *n 

The action integral flows to 



i f [e\= / LwnmAdt 

Jtj+eT 

-J 



ti+eT 
t 2 +eT 



L[0[q[t' - eT], e], [q[f - eT], e ]q[f - eT], H] dt' 

tl +eT <*1 



which may be regarded as a function of e. Calculating the derivative at g = Oand using the symmetry, we get 
0= ^[0] =L[q[t 2 ] 1 q[t 2 ] ) t 2 ]T-L[q[i 1 ],q[t 1 ],t 1 ]r 

f t2 dL ( d(j>. m d</>\ dL ( d 2 <f> . 2m d 2 <j> . d<j>.\ , 



(dq) 2 ^ de,9q M 9q 
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Notice that the Euler— Lagrange equations imply 
d fdLd<p \ ( ddL\ d<f> 



AT = 



qT + — (—?$\ qT+ — — qT 
dq \dtdqj dq dq 



dt \dqdq ) \dtdqj dq 

dLdcj).^ dL ( d 2 <f) . \ . _ dL d<f> .. _ 

= dq~d^ T+ dq~ {(d^V qT+ dq~^ 

Substituting this into the previous equation, one gets 

0 = ^[0] =L[q[« 2 ] )q [t 2 ] } t 2 ]r-L[ q [t 1 ] }q [t 1 ] ) i 1 ]T- g^q^T + ^^q^r 

f t2 dL d<h dL d 2 <h . 

+ / H —qdt. 

J tl dq de dq dedq 

Again using the Euler— Lagrange equations we get 

d / dLd^\ _ ( ddV\ d$ dL d 2 <j> . _dLd(f> dL d 2 ^ . 
dt\dqdej \dtdqj de dqdedq^ dq de dqdedq^' 

Substituting this into the previous equation, one gets 

0 = L[q[f 2 ], qfe], t 2 ]T - L[q[h], qfr], t,]T - ^^q[t 2 ]T + ^^q[ tl ]T 
dLdcj> r . dLd<j>, . 

From which one can see that 



\ <9q dq J dq de 



r i d <t> * 

is a constant of the motion, i.e. a conserved quantity. Since 0[q, OJ = q, we get ^— = land so the conserved 

quantity simplifies to 



(i 

\ dq ) dq de 

To avoid excessive complication of the formulas, this derivation assumed that the flow does not change as time 
passes. The same result can be obtained in the more general case. 

Field-theoretic derivation 

Noether's theorem may also be derived for tensor fields cp A where the index A ranges over the various components of 
the various tensor fields. These field quantities are functions defined over a four-dimensional space whose points are 
labeled by coordinates where the index \x ranges over time (\i=0) and three spatial dimensions (|i=l,2,3). These 
four coordinates are the independent variables; and the values of the fields at each event are the dependent variables. 
Under an infinitesimal transformation, the variation in the coordinates is written 

x fi ^^ = x fi + 8x* 
whereas the transformation of the field variables is expressed as 

By this definition, the field variations Sep result from two factors: intrinsic changes in the field themselves and 
changes in coordinates, since the transformed field a depends on the transformed coordinates § . To isolate the 
intrinsic changes, the field variation at a single point may be defined 

a A {x*) = <j> A {x^) + S^ix*) . 
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If the coordinates are changed, the boundary of the region of space-time over which the Lagrangian is being 
integrated also changes; the original boundary and its transformed version are denoted as £1 and £l\ respectively. 

Noether's theorem begins with the assumption that a specific transformation of the coordinates and field variables 
does not change the action, which is defined as the integral of the Lagrangian density over the given region of 
spacetime. Expressed mathematically, this assumption may be written as 

where the comma subscript indicates a partial derivative with respect to the coordinate(s) that follows the comma, 
e.g. 

Since § is a dummy variable of integration, and since the change in the boundary £1 is infinitesimal by assumption, 
the two integrals may be combined using the four-dimensional version of the divergence theorem into the following 
form 

jf | [L (a A , a A u , x») - L (f, <j> A >v , *")] + A [l (<I> a , <j> A ^ *") 5x°] J <fix = 0 . 
The difference in Lagrangians can be written to first-order in the infinitesimal variations as 

[L [ a \a\, s") - L <„, *)] = «V + g-Stf 



d(j) A * d(j> A * CT 

However, because the variations are defined at the same point as described above, the variation and the derivative 
can be done in reverse order; they commute 

dx° dx a V v I 
Using the Euler-Lagrange field equations 

d ( dL \ dL 



dx a \d(f) A a ) d(f) A 
the difference in Lagrangians can be written neatly as 



Thus, the change in the action can be written as 



Since this holds for any region Q., the integrand must be zero 

d 
dx a 

For any combination of the various symmetry transformations, the perturbation can be written 
6xP = eX" 

8(j) A = e^ A = ~8<t> A + e£, x <f> A 
where (Lx^> A * s me Lie derivative of (j) A in the X^ direction. When (j) A is a scalar or X^ jV = 0, 

v dx» 

These equations imply that the field variation taken at one point equals 
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8<t> A = e^ A - eC x (j> A . 

Differentiating the above divergence with respect to 8 at 8=0 and changing the sign yields the conservation law 
d 



r = o 

lc 

£ X (j) A -LX a 



where the conserved current equals 
dL 



dJ> A , 



Manifold/fiber bundle derivation 

Suppose we have an ^-dimensional oriented Riemannian manifold, M and a target manifold T. Let C be the 
configuration space of smooth functions from M to T. (More generally, we can have smooth sections of a fiber 
bundle over M.) 

Examples of this M in physics include: 

• In classical mechanics, in the Hamiltonian formulation, M is the one-dimensional manifold R, representing time 
and the target space is the cotangent bundle of space of generalized positions. 

• In field theory, M is the spacetime manifold and the target space is the set of values the fields can take at any 
given point. For example, if there are m real- valued scalar fields, cp 1? . . -,<P m , then the target manifold is R m . If the 
field is a real vector field, then the target manifold is isomorphic to R 3 . 

Now suppose there is a functional 

S : C -> R 5 

called the action. (Note that it takes values into JJ_ , rather than C \ tms * s f° r physical reasons, and doesn't really 
matter for this proof.) 

To get to the usual version of Noether's theorem, we need additional restrictions on the action. We assume S[(j)] is 
the integral over M of a function 
^0, x) 

called the Lagrangian density, depending on cp, its derivative and the position. In other words, for cp in Q 



s[4>] = [ C[4>{x),dr4>(x),z]d n x. 

Jm 



' M 

Suppose we are given boundary conditions, ie., a specification of the value of 0 at the boundary if M is compact, or 
some limit on 0 as x approaches ©o. Then the subspace of Q consisting of functions 0 such that all functional 
derivatives of S at cp are zero, that is: 

8<f>{x) 

and that 0 satisfies the given boundary conditions, is the subspace of on shell solutions. (See principle of stationary 
action) 

Now, suppose we have an infinitesimal transformation on Q » generated by a functional derivation, Q such that 



/ Cd n x\ [ 
Jn J Jd 



q I Cd n x\ k I ^[0(x) ? a0,aa0,...]d 5p 



tdN 

for all compact submanifolds N or in other words, 

Q[C(x)} « dpfHx) 

for all x, where we set 

C(x) = jC[0(x), 9^0(x), x]. 



Noether's Theorem 



379 



If this holds on shell and off shell, we say Q generates an off- shell symmetry. If this only holds on shell, we say Q 
generates an on-shell symmetry. Then, we say Q is a generator of a one parameter symmetry Lie group. 

Now, for any N, because of the Euler-Lagrange theorem, on shell (and only on-shell), we have 



Q 



/ . 

JdN 



Since this is true for any N, we have 

dC 



WW) 



Q[d>] - r 



0. 



But this is the continuity equation for the current defined by:^ 

DC 



J* = 



Q[d>] - 



which is called the Noether current associated with the symmetry. The continuity equation tells us that if we 
integrate this current over a space-like slice, we get a conserved quantity called the Noether charge (provided, of 
course, if M is noncompact, the currents fall off sufficiently fast at infinity). 



Comments 

Noether's theorem is really a reflection of the relation between the boundary conditions and the variational principle. 
Assuming no boundary terms in the action, Noether's theorem implies that 

/ J^ds^ w 0. 

JdN 

Noether's theorem is an on shell theorem. The quantum analog of Noether's theorem are the Ward-Takahashi 
identities. 



Generalization to Lie algebras 

Suppose say we have two symmetry derivations and <2 2 - Then, [Q^, is also a symmetry derivation. Let's see 
this explicitly. Let's say 

and 

Then, 

[Oil Q*m = Ol[0 2 [£]] - 0a[0l[4] ~ Wu 

where fj^QJf/j-Q^f^]. So, 

ifi = (d^jf) (Oi[Q 2 [0]] - OalOiMD - /r 2 - 

This shows we can extend Noether's theorem to larger Lie algebras in a natural way. 
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Generalization of the proof 

This applies to any local symmetry derivation Q satisfying QS ~ 0 , and also to more general local functional 

differentiable actions, including ones where the Lagrangian depends on higher derivatives of the fields. Let 8 be any 

arbitrary smooth function of the spacetime (or time) manifold such that the closure of its support is disjoint from the 

boundary. 8 is a test function. Then, because of the variational principle (which does not apply to the boundary, by 

the way), the derivation distribution q generated by q[8][0(x)] = 8(x)Q[0(x)] satisfies g[e][5'] ~ Ofor any 8, or 

more compactly, q(x) [S] ~ Ofor all x not on the boundary (but remember that q(x) is a shorthand for a derivation 

distribution, not a derivation parametrized by x in general). This is the generalization of Noether's theorem. 

To see how the generalization related to the version given above, assume that the action is the spacetime integral of a 

Lagrangian that only depends on 0 and its first derivatives. Also, assume 

Q[£] « dpf" 

Then, 



J q[e][C]d n x 



+ 



d 



1%) 



d^Q[<t>\) 



d 



5>] | d"rc 



for all e. 

More generally, if the Lagrangian depends on higher derivatives, then 



r- 



Q[<t>] - 2 



6(3^,8^) 



d v Q[<j)} + 8 V 



d 



d{d,j,d v 4>) 



Q[(f>\ 



Examples 



Example 1: Conservation of energy 

Looking at the specific case of a Newtonian particle of mass m, coordinate x, moving under the influence of a 
potential V, coordinatized by time t. The action, S, is: 



S[x] = J L[x(t),x(t)]dt 



^±?-V{x{t)))dt. 



i=l 

Consider the generator of time translations Q = d/dt . In other words, Q[x(t)] = x(t) . Note that x has an 
explicit dependence on time, whilst V does not; consequently: 

m 



Q[L}=m^i i x l -Y:^ 1 ii = f t 



so we can set 

777, 



Then, 
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m 



52$- V(x) 



The right hand side is the energy and Noether's theorem states that j = Q(i.e. the principle of conservation of 

energy is a consequence of invariance under time translations). 

More generally, if the Lagrangian does not depend explicitly on time, the quantity 

i=l ux * 

(called the Hamiltonian) is conserved. 

Example 2: Conservation of center of momentum 

Still considering 1 -dimensional time, let 



S[x\ = J C[x{t),S{t)} dt 

-I 



dt 



i.e. N Newtonian particles where the potential only depends pairwise upon the relative displacement. 
— * 

For Q , let's consider the generator of Galilean transformations (i.e. a change in the frame of reference). In other 
words, 

QAxm = ts{. 

Note that 

Qi[C] = y^m^X^ - Y d i V cx{3(X{3 - x a )(t - t) 



OL<(3 



This has the form of — / m a x a 



so we can set 



Then, 



/ = ^2m a x a . 

OL 

J-E(±*)-«w-/ 

= y2{m a x a t — m a x a ) 

a 

= Pt- Mx CM 

where pis the total momentum, M is the total mass and Xqm * s me center of mass. Noether's theorem states: 



j = 0 P - Mx CM = 0. 
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Example 3: Conformal transformation 

Both examples 1 and 2 are over a 1 -dimensional manifold (time). An example involving spacetime is a conformal 
transformation of a massless real scalar field with a quartic potential in (3 + l)-Minkowski spacetime. 



^ = J £[0(x),^0(x)]d 4 x 

= J Q#W-A0 4 ) d 4 



For g, consider the generator of a spacetime rescaling. In other words, 

Q[0(x)] =a^a #l 0(x) + 0(ar). 
The second term on the right hand side is due to the "conformal weight" of cp. Note that 

Q[C] = (d^ + x v dpd v 4> + d^) - 4A0 3 (afd^ + <t>) . 



This has the form of 
1 



9, 



-x"d v (j)d v (j) - A^0 4 



r = 



-c 



Q[<f>] - /" 



(where we have performed a change of dummy indices) so set 
Then, 

a 

= d»<j) {x v d u <j> + 0) - (^d v <l>d u <f> - A0 4 j . 

Noether's theorem states that d^j^ = 0(as one may explicitly check by substituting the Euler-Lagrange equations 
into the left hand side). 

(Aside: If one tries to find the Ward-Takahashi analog of this equation, one runs into a problem because of 
anomalies.) 

Applications 

Application of Noether's theorem allows physicists to gain powerful insights into any general theory in physics, by 
just analyzing the various transformations that would make the form of the laws involved invariant. For example: 

• the invariance of physical systems with respect to spatial translation (in other words, that the laws of physics do 
not vary with locations in space) gives the law of conservation of linear momentum; 

• invariance with respect to rotation gives the law of conservation of angular momentum; 

• invariance with respect to time translation gives the well-known law of conservation of energy 

In quantum field theory, the analog to Noether's theorem, the Ward-Takahashi identity, yields further conservation 
laws, such as the conservation of electric charge from the invariance with respect to a change in the phase factor of 
the complex field of the charged particle and the associated gauge of the electric potential and vector potential. 

rm 

The Noether charge is also used in calculating the entropy of stationary black holes. 
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Goldstone theorem 



In particle and condensed matter physics, Goldstone bosons (also known as Nambu-Goldstone bosons) are bosons 
that appear necessarily in models exhibiting spontaneous breakdown of continuous symmetries (cf. spontaneously 
broken symmetry). They were discovered by Nambu in the context of the BCS superconductivity mechanism, 1 ^ and 
subsequently elucidated by Goldstone, and systematically generalized in the context of quantum field theory. 

These spinless bosons correspond to the spontaneously broken internal symmetry generators, and are characterized 
by the quantum numbers of these. They transform nonlinearly (shift) under the action of these generators, and can 
thus be excited out of the asymmetric vacuum by these generators. Thus, they can be thought of as the excitations of 
the field in the broken symmetry directions in group space — and are massless if the spontaneously broken symmetry 
is not also broken explicitly. If, instead, the symmetry is not exact, i.e., if it is explicitly broken as well as 
spontaneously broken, then the Nambu-Goldstone bosons are not massless, though they typically remain relatively 
light; they are then called pseudo-Goldstone bosons or pseudo-Nambu-Goldstone bosons (abbreviated PNGBs). 

Goldstone f s theorem 

Goldstone's theorem examines a generic continuous symmetry which is spontaneously broken; i.e., its currents are 
conserved, but the ground state (vacuum) is not invariant under the action of the corresponding charges. Then, 
necessarily, new massless (or light, if the symmetry is not exact) scalar particles appear in the spectrum of possible 
excitations. There is one scalar particle — called a Nambu-Goldstone boson — for each generator of the symmetry that 
is broken; i.e., that does not preserve the ground state. The Nambu-Goldstone mode is a long- wavelength fluctuation 
of the corresponding order parameter. 

There is an arguable loophole in the theorem. If one reads the theorem carefully, it only states that there exist 
non- vacuum states with arbitrarily small energies. Take for example a chiral N = 1 super QCD model with a nonzero 
squark VEV which is conformal in the IR. The chiral symmetry is a global symmetry which is (partially) 
spontaneously broken. Some of the "Goldstone bosons" associated with this spontaneous symmetry breaking are 
charged under the unbroken gauge group and hence, these composite bosons have a continuous mass spectrum with 
arbitrarily small masses but yet there is no Goldstone boson with exactly zero mass. In other words, the Goldstone 
bosons are infraparticles. 

In theories with gauge symmetry, the Goldstone bosons are "eaten" by the gauge bosons. The latter become massive 
and their new, longitudinal polarization is provided by the Goldstone boson. 

A simple example 

Consider a complex scalar field cp (phi), with the constraint that cp*cp = k 2 . One way to get a constraint of this sort is 
by including a potential 

A 2 (0*0-fc 2 ) 2 , 

and taking the limit as X —> °o (this is called the Abelian nonlinear a-model). 

The constraint, and the action, below, are invariant under a U(l) phase transformation, Sep = /ecp. The field can be 
redefined to give a real scalar field (i.e., a spin-zero particle) 9 without any constraint by 

<j) = ke ie 

where 9 is the Nambu-Goldstone boson (actually k9 is), and the U(l) symmetry transformation effects a shift on 9, 
namely 69 = 8, but does not preserve the ground state 10) , (i.e. the above infinitesimal transformation does not 
annihilate it — the hallmark of invariance), as evident in the charge of the current below. The vacuum is degenerate 
and nonin variant under the spontaneously broken symmetry. 

The corresponding Lagrangian density is given by 
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C = -^(5^0*)^0+m 2 0*0 = -l(-ike- ie d*d){ikj%d)+m 2 k 2 = -^-(d fi 6)(d fi e)+m 2 k 2 , 

2 

and the symmetry-induced conserved U(l) current is / = - k 3 9 . The charge resulting from this current, Q , shifts 
9 and the ground state to a new, degenerate, ground state. Thus, a vacuum with (9)= 0 will shift to a different 
vacuum with D(90= - s. The current connects the original vacuum with the Nambu- Golds tone state, 001 ^ 0 (0) I0D * 0. 

Note that the constant term m 2 k 2 in the Lagrangian density has no physical significance, and the other term in it is 
simply the kinetic term for a massless scalar. 

In general, in a theory with several scalar fields, (p., the Nambu-Goldstone mode cp is massless, and parameterises 
the curve of possible (degenerate) vacuum states. Its hallmark under the broken symmetry transformation is 
nonvanishing vacuum expectation (6qp ), an order parameter, for vanishing ((jp )=0, at some ground state 10) chosen 
at the minimum of the potential, ( 3 V / 3cp.)=0. 

This nonvanishing vacuum expectation of the transformation increment, (S(jp g ), specifies the relevant (Goldstone) 
null eigenvector of the mass matrix, { 3 2 V / 3cp. 3cp ) <6cp )= 0, and hence the corresponding zero-mass eigenvalue. 

Goldstone f s Argument 

The principle behind Goldstone' s argument is that the ground state is not unique. Normally, by current conservation, 
the charge operator for any symmetry current is time-independent, 

Acting with the charge operator on the vacuum either annihilates the vacuum, if that is symmetric; else, if not, as is 
the case in spontaneous symmetry breaking, it produces a zero-frequency state out of it, through its shift 
transformation feature illustrated above. (Actually, here, the charge itself is ill-defined; but its better behaved 
commutators with fields, so, then, the transformation shifts, are still time-invariant, d(bop^/dt = 0 .) 

So, if the vacuum is not invariant under the symmetry, action of the charge operator produces a state which is 
different from the vacuum chosen, but which has zero frequency. This is a long-wavelength oscillation of a field 
which is nearly stationary: there are states with zero frequency, so that the theory cannot have a mass gap. 

This argument is clarified by taking the limit carefully. If an approximate charge operator is applied to the vacuum, 

T*Q* = 12 f e^J°(x) = - f e^V J= I V(efS) • J , 
at at Jx Jx Jx 

a state with approximately vanishing time derivative is produced, 

I||oa|o>||»^||Oa|o>||. 

Assuming a mass gap mo, the frequency of any state like the above, which is orthogonal to the vacuum, is at least 
mo, 

ii|iv)ii = iw)ii>m 0 |i m . 

Letting A become large leads to a contradiction. 

This argument fails, however, when the symmetry is gauged, because then the symmetry generator is only 
performing a gauge transformation. A gauge transformed state is the same exact state, so that acting with a symmetry 
generator does not get one out of the vacuum. 
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Nonrelativistic theories 

A version of Goldstone' s theorem also applies to nonrelativistic theories (and also relativistic theories with 
spontaneously broken spacetime symmetries, such as Lorentz symmetry or conformal symmetry, rotational, or 
translational in variance). 

It essentially states that, for each spontaneously broken symmetry, there corresponds some quasiparticle with no 
energy gap — the nonrelativistic version of the mass gap. (Note that the energy here is really JJ — ^JV — 5 • P 

and not JJ .) However, two different spontaneously broken generators may now give rise to the same 
Nambu-Goldstone boson. For example, in a superfluid, both the U(l) particle number symmetry and Galilean 
symmetry are spontaneously broken. However, the phonon is the Goldstone boson for both. 

In general, the phonon is effectively the Nambu-Goldstone boson for spontaneously broken Galilean/Lorentz 
symmetry. However, in contrast to the case of internal symmetry breaking, when spacetime symmetries are broken, 
the order parameter need not be a scalar field, but may be a tensor field, and the corresponding independent massless 
modes may now be fewer than the number of spontaneously broken generators, because the Goldstone modes may 
now be linearly dependent among themselves: e.g., the Goldstone modes for some generators might be expressed as 
gradients of Goldstone modes for other broken generators. 

Nambu-Goldstone fermions 

Spontaneously broken global fermionic symmetries, which occur in some supersymmetric models, lead to 
Nambu-Goldstone fermions, or goldstinos^ ^ These have spin V2, instead of 0, and carry all quantum numbers of 
the respective supersymmetry generators broken spontaneously. 

(Vestigial bosonic superpartners of these goldstinos under supersymmetries, called sgoldstinos, might sometimes 
also appear. Strictly speaking, however, spontaneous symmetry, and supersymmetry, breaking smashes up 
("reduces") multiplet and supermultiplet structures, into the characteristic nonlinear realizations of broken 
supersymmetry, so that, in a technical sense, goldstinos are superpartners of all particles in the theory, of any spin.) 

Goldstone bosons in nature 

• In fluids, the phonon is longitudinal and it is the Goldstone boson of the spontaneously broken Galilean 
symmetry. In solids, the situation is more complicated; the Goldstone bosons are the longitudinal and transverse 
phonons and they happen to be the Goldstone bosons of spontaneously broken Galilean, translational, and 
rotational symmetry with no simple one-to-one correspondence between the Goldstone modes and the broken 
symmetries. 

• In magnets, the original rotational symmetry (present in the absence of an external magnetic field) is 
spontaneously broken such that the magnetization points into a specific direction. The Goldstone bosons then are 
the magnons, i.e., spin waves in which the local magnetization direction oscillates. 

• The pions are the pseudo-Goldstone bosons that result from the spontaneous breakdown of the chiral-flavor 
symmetries of QCD effected by quark condensation due to the strong interaction. These symmetries are further 
explicitly broken by the masses of the quarks, so that the pions are not massless, but their mass is significantly 
smaller than typical hadron masses. 

• The longitudinal polarization components of the W and Z bosons correspond to the Goldstone bosons of the 
spontaneously broken part of the electroweak symmetry SU(2)®U(1). Because this symmetry is gauged, the three 
would-be Goldstone bosons are "eaten" by the three gauge bosons corresponding to the three broken generators; 
this gives these three gauge bosons a mass, and the associated necessary third polarization degree of freedom. 
This is described in the Standard Model through the Higgs mechanism. 
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In mathematics and in theoretical physics, the Stone-von Neumann theorem is any one of a number of different 
formulations of the uniqueness of the canonical commutation relations between position and momentum operators. 
The name is for Marshall Stone and von Neumann (1931). 

Trying to represent the commutation relations 

In quantum mechanics, physical observables are represented mathematically by linear operators on Hilbert spaces. 
For a single particle moving on the real line R, there are two important observables: position and momentum. In the 
quantum-mechanical description of such a particle, the position operator Q and momentum operator P are 
respectively given by 



on the domain V of infinitely differentiable functions of compact support on R. We assume ft is a fixed non-zero 
real number — in quantum theory ft is (up to a factor of 2jt) Planck's constant, which is not dimensionless; it takes 
a small numerical value in terms of units in the macroscopic world. The operators P, Q satisfy the commutation 
relation 



Already in his classic volume, Hermann Weyl observed that this commutation law was impossible for linear 
operators P, Q acting on finite dimensional spaces (as is clear by applying the trace of a matrix), unless ft vanishes. 
A little analysis shows that in fact any two self-adjoint operators satisfying the above commutation relation cannot be 
both bounded. 

Uniqueness of representation 

One would like to classify representations of the canonical commutation relation by two self-adjoint operators acting 
on separable Hilbert spaces, up to unitary equivalence. By Stone's theorem, there is a one-to-one correspondence 
between self-adjoint operators and (strongly continuous) one parameter unitary groups. Let Q and P be two 
self-adjoint operators satisfying the canonical commutation relation, and e ir ^ and e ls P be the corresponding unitary 
groups given by functional calculus. A formal computation with power series shows that 
tftQjsP _ e ist e isPjtQ _ Q 

Conversely, given two one parameter unitary groups U(t) and V(s) satisfying the relation 
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[Qip](x) = xtp(x) 
[P4>](x) = 



QP-PQ = 




U{t)V{s) = e ist V{s)U{t) Va, t : 



(*) 
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formally differentiating at 0 shows that the two infinitesmal generators satisfy the canonical commutation relation. 
These formal calculations can be made rigorous. Therefore there is a one-to-one correspondence between 
representations of the canonical commutation relation and two one parameter unitary groups U(t) and V(s) satisfying 
(*). This formulation of the canonical commutation relations for one parameter unitary groups is called the Weyl 
form of the CCR. 

The problem now thus becomes classifying two jointly irreducible one parameter unitary groups U(t) and V(s) 
satisfying the Weyl relation on separable Hilbert spaces. The answer is the content of the Stone-von Neumann 
theorem: all such pairs of one parameter unitary groups are unitarily equivalent. In other words, for any two such 
U(t) and V(s) acting jointly irreducibly on a Hilbert space H, there is a unitary operator 

W : L 2 (M) -+H 

so that 

W*U{t)W = e isQ and W*V{t)W = e isP , 
where P and Q are the position and momentum operators from above. 

Historically this result was significant because it was a key step in proving that Heisenberg's matrix mechanics which 
presents quantum mechanical observables and dynamics in terms of infinite matrices, is unitarily equivalent to 
Schrodinger's wave mechanical formulation (see Schrodinger picture). 

Representation theory formulation 

In terms of representation theory, the Stone— von Neumann theorem classifies certain unitary representations of the 
Heisenberg group. This is discussed in more detail in the Heisenberg group section, below. 

Informally stated, with certain technical assumptions, every representation of the Heisenberg group i?2n+li s 
equivalent to the position operators and momentum operators on R^. Alternatively, that they are all equivalent to the 
Weyl algebra (or CCR algebra) on a symplectic space of dimension 2n. 

More formally, there is a unique (up to scale) non-trivial central strongly continuous unitary representation. 

This was later generalized by Mackey theory - and was the motivation for the introduction of the Heisenberg group 
in quantum physics. 

In detail: 

2n 

• The continuous Heisenberg group is a central extension of the abelian Lie group R by a copy of R, 

• the corresponding Heisenberg algebra is a central extension of the abelian Lie algebra R (with trivial bracket) 
by a copy of R, 

• the discrete Heisenberg group is a central extension of the free abelian group Z by a copy of Z, and 

• the discrete Heisenberg group module p is a central extension of the free abelian p-group (Z/pZ) 2n by a copy of 
Z/pZ. 

These are thus all semidirect product, and hence relatively easily understood. In all cases, if one has a representation 
H _> J[ where the center maps to zero, then one simply has a representation of the corresponding abelian group or 
algebra, which is Fourier theory. 

If the center does not map to zero, one has a more interesting theory, particularly if one restricts oneself to central 
representations. 

Concretely, by a central representation one means a representation such that the center of the Heisenberg group maps 
into the center of the algebra: for example, if one is studying matrix representations or representations by operators 
on a Hilbert space, then the center of the matrix algebra or the operator algebra is the scalar matrices. Thus the 
representation of the center of the Heisenberg group is determined by a scale value, called the quantization value (in 
physics terms, Planck's constant), and if this goes to zero, one gets a representation of the abelian group (in physics 
terms, this is the classical limit). 
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More formally, the group algebra of the Heisenberg group K [H] has center j^T [R] 5 so rather than simply thinking 
of the group algebra as an algebra over the field of scalars K f one may think of it as an algebra over the commutative 
algebra i(T[R]. As the center of a matrix algebra or operator algebra is the scalar matrices, a K [R] -structure on 
the matrix algebra is a choice of scalar matrix - a choice of scale. Given such a choice of scale, a central 
representation of the Heisenberg group is a map of K [R] -algebras if [J?] — > A, which is the formal way of 
saying that it sends the center to a chosen scale. 

Then the Stone-von Neumann theorem is that, given a quantization value, every strongly continuous unitary 
representation is unitarily equivalent to the standard representation as position and momentum. 

Reformulation via Fourier transform 

A 

Let G be a locally compact abelian group and G be the Pontryagin dual of G. The Fourier-Plancherel transform 
defined by 

/ >-> /(7) = / W)f(t)dKt) 
Jg 

A 

extends to a C* -isomorphism from the group C*-algebra C*(G) of G and C Q (G ), i.e. the spectrum of C*(G) is 
precisely G . When G is the real line R, this is Stone's theorem characterizing one parameter unitary groups. The 
theorem of Stone-von Neumann can also be restated using similar language. 

The group G acts on the C*-algebra C Q (G) by right translation g: for s in G and/in C Q (G), 
(s-f)(t) = f(t + s). 

A 

Under the isomorphism given above, this action becomes the natural action of G on C*(G ): 

H)(7) = 7(*. 

So a covariant representation corresponding to the C*-crossed product 

C*(G) G 

A 

is a unitary representation U(s) of G and V(y) of G such that 

U(s)V( 1 )U*(s)= 1 ( S )V( 1 ). 

It is a general fact that covariant representations are in one-to-one correspondence with * -representation of the 
corresponding crossed product. On the other hand, all irreducible representations of 

C 0 (G) G 

2 2 

are unitarily equivalent to the K(L( G)), the compact operators on L( G)). Therefore all pairs {U(s) 9 V(y)} are 
unitarily equivalent. Specializing to the case where G = R yields the Stone-von Neumann theorem. 



The Heisenberg group 



The commutation relations for P, Q look very similar to the commutation relations that define the Lie algebra of 
gener; 
form 



general Heisenberg group for n a positive integer. This is the Lie group of (rc+2) x (n+2) square matrices of the 



lac 
M(a, 6, c) = 0 l n b 
0 0 1 

In fact, using the Heisenberg group, we can formulate a far-reaching generalization of the Stone von Neumann 
theorem. Note that the center of H consists of matrices M(0, 0, c). 



Theorem. For each non-zero real number h there is an irreducible representation JJ ^ acting on the Hilbert space 
[U h (M(a, b, c))]1>(x) = ^ bx+hc ^(x + ha). 



L 2 (R") by 
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All these representations are unitarily inequivalent and any irreducible representation which is not trivial on the 
center of is unitarily equivalent to exactly one of these. 

Note that £/ is a unitary operator because it is the composition of two operators which are easily seen to be unitary: 
the translation to the left by h a and multiplication by a function of absolute value 1 . To show U h is multiplicative is 
a straightforward calculation. The hard part of the theorem is showing the uniqueness which is beyond the scope of 
the article. However, below we sketch a proof of the corresponding Stone-von Neumann theorem for certain finite 
Heisenberg groups. 

In particular, irreducible representations jt, Jt' of the Heisenberg group H^ which are non-trivial on the center of H^ 
are unitarily equivalent if and only if Jt(z) = Jt'(z) for any z in the center of H^. 

One representation of the Heisenberg group that is important in number theory and the theory of modular forms is 
the theta representation, so named because the Jacobi theta function is invariant under the action of the discrete 
subgroup of the Heisenberg group. 

Relation to the Fourier transform 

For any non-zero h, the mapping 

a h : M(a 5 6, c) — > M(-fr _1 b 5 ha, c - ab) 
is an automorphism of H^ which is the identity on the center of H^. In particular, the representations U and U a are 
unitarily equivalent. This means that there is a unitary operator W on L 2 (R") such that for any g in H^, 

WU h (g)W* = U h a(g). 

Moreover, by irreducibility of the representations £7^, it follows that up to a scalar, such an operator W is unique (cf . 
Schur's lemma). 

2 n 

Theorem. The operator Wis, up to a scalar multiple, the Fourier transform on L (R ). 
This means that (ignoring the factor of (2 Jt) in the definition of the Fourier transform) 

J e- ixp e^ bx ^ hc ^(x + ha) dx = e *(^*+Mc-M) / e -*-(p->ty( tf ) dy. 

The previous theorem can actually be used to prove the unitary nature of the Fourier transform, also known as the 
Plancherel theorem. Moreover, note that 

(a h ) 2 M(a, 6, c) = M(-a, -6, c). 
Theorem. The operator W 1 such that 

W 1 U h W* = U h a\g) 
is the reflection operator 

[W^](x) = xP(-x). 
From this fact the Fourier inversion formula easily follows. 

Representations of finite Heisenberg groups 

The Heisenberg group H^(K) is defined for any commutative ring K. In this section let us specialize to the field K = 
Z/p Z for p a prime. This field has the property that there is an imbedding oo of K as an additive group into the circle 
group T. Note that H^(K) is finite with cardinality IKI 2 n+1 . For finite Heisenberg group H^(K) one can give a simple 
proof of the Stone-von Neumann theorem using simple properties of character functions of representations. These 
properties follow from the orthogonality relations for characters of representations of finite groups. 

2 n 

For any non-zero h in K define the representation U on the finite-dimensional inner product space / (K ) by 

[U h M(a, 6, c)ijj\{x) — u;(b ■ x + hc)^(x + ha). 
Theorem. For a fixed non-zero h, the character function x of U is given by: 
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. (\K\ n u(hc) ifa = b=0 

X (M(a,6 |C )) = |' ^ > otterw . 8e 

It follows that 

imJ^ )|,= i5^ |KnK|=1 - 

By the orthogonality relations for characters of representations of finite groups this fact implies the corresponding 
Stone-von Neumann theorem for Heisenberg groups H (Z//? Z), particularly: 

• Irreducibility of £/ 

• Pairwise inequivalence of all the representations £7^. 

Generalizations 

The Stone— von Neumann theorem admits numerous generalizations. Much of the early work of George Mackey was 
directed at obtaining a formulation of the theory of induced representations developed originally by Frobenius for 
finite groups to the context of unitary representations of locally compact topological groups. 
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Peter- Weyl theorem 

In mathematics, the Peter-Weyl theorem is a basic result in the theory of harmonic analysis, applying to 
topological groups that are compact, but are not necessarily abelian. It was initially proved by Hermann Weyl, with 
his student Fritz Peter, in the setting of a compact topological group G (Peter & Weyl 1927). The theorem is a 
collection of results generalizing the significant facts about the decomposition of the regular representation of any 
finite group, as discovered by F. G. Frobenius and Issai Schur. 

The theorem has three parts. The first part states that the matrix coefficients of G are dense in the space C(G) of 

2 

continuous complex- valued functions on G, and thus also in the space L (G) of square-integrable functions. The 

second part asserts the complete reducibility of unitary representations of G. The third part then asserts that the 

2 

regular representation of G on L (G) decomposes as the direct sum of all irreducible unitary representations. 

2 

Moreover, the matrix coefficients of the irreducible unitary representations form an orthonormal basis of L (G). 

Matrix coefficients 

A matrix coefficient of the group G is a complex- valued function 0 ; on G given as the composition 

if = L O 7T 

where jt : G — » GL(V) is a finite-dimensional (continuous) group representation of G, and L is a linear functional on 
the vector space of endomorphisms of V (e.g. trace), which contains GL(V) as an open subset. Matrix coefficients are 
continuous, since representations are by definition continuous, and linear functionals on finite-dimensional spaces 
are also continuous. 

The first part of the Peter— Weyl theorem asserts (Bump 2004, §4.1; Knapp 1986, Theorem 1.12): 

• The set of matrix coefficients of G is dense in the space of continuous complex functions C(G) on G, equipped 
with the uniform norm. 

This first result resembles the Stone-Weierstrass theorem in that it indicates the density of a set of functions in the 
space of all continuous functions, subject only to an algebraic characterization. In fact, if G is a matrix group, then 
the result follows easily from the Stone-Weierstrass theorem (Knapp 1986, p. 17). Conversely, it is a consequence of 
the subsequent conclusions of the theorem that any compact Lie group is isomorphic to a matrix group (Knapp 1986, 
Theorem 1.15). 

2 

A corollary of this result is that the matrix coefficients of G are dense in L (G). 

Decomposition of a unitary representation 

The second part of the theorem gives the existence of a decomposition of a unitary representation of G into 
finite-dimensional representations. Now, intuitively groups were conceived as rotations on geometric objects, so it is 
only natural to study representations which essentially arise from continuous actions on Hilbert spaces. (For those 
who were first introduced to dual groups consisting of characters which are the continuous homomorphisms into the 
circle group {zeC:|z|=l} , this approach is similar except that the circle group is (ultimately) generalised to the 
group of unitary operators on a given Hilbert space.) 
Let G be a topological group and H a complex Hilbert space. 

Given a continuous action *:GxH^H , it gives rise to a map p m -.G^ H H defined in the obvious way: p«(v)=gv . This 
map is clearly an homomorphism from G into GL(H), the homeomorphic automorphisms of H. And given such a 
map, we can uniquely recover the action in the obvious way. 

Thus we define the representations of G on an Hilbert space H to be those group homomorphisms, p, which arise 
from continuous actions of G on H. We say that a representation p is unitary if p(g) is a unitary operator for all 
g G G; i.e., {gv,gw)={v : w) for all v, w E H. (I.e. it is unitary if p.G^U(H). Notice how this generalises the special 
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case of the one-dimensional Hilbert space, where ^(C 1 ) is just the circle group.) 

Given these definitions, we can state the Peter-Weyl theorem. The second part of this theorem asserts (Knapp 1986, 
Theorem 1.14): 

• Let p be a unitary representation of a compact group G on a complex Hilbert space H. Then H splits into an 
orthogonal direct sum of irreducible finite-dimensional unitary representations of G. 



Decomposition of square-integrable functions 

To state the third and final part of the theorem, there is a natural Hilbert space over G consisting of square-integrable 

2 

functions, L (G); this makes sense because Haar measure exists on G. Calling this Hilbert space H, the group G has a 
unitary representation p on H by acting on the left, via 

p{h)f(g) = Hh-'g). 

The final statement of the Peter— Weyl theorem (Knapp 1986, Theorem 1.14) gives an explicit orthonormal basis of 

2 2 
L (G). Roughly it asserts that the matrix coefficients for G, suitably renormalized, are an orthonormal basis of L (G). 

2 

In particular, L (G) decomposes into an orthogonal direct sum of all the irreducible unitary representations, in which 
the multiplicity of each irreducible representation is equal to its degree (that is, the dimension of the underlying 
space of the representation). Thus, 

L\G) = 

where 2 denotes the set of (isomorphism classes of) irreducible unitary representations of G, and the summation 
denotes the closure of the direct sum of the total spaces E of the representations jt. 

More precisely, suppose that a representative jt is chosen for each isomorphism class of irreducible unitary 
representation, and denote the collection of all such jt by 2. Let u^be the matrix coefficients of jt in an 

orthonormal basis, in other words 
u if(9) = Mff)ei,ej). 

for each g E G. Finally, let cf 71 ^ be the degree of the representation jt. The theorem now asserts that the set of 
functions 



[Vd^uff | 7T G E, 1 < 3 < d {n) } 



2 

is an orthonormal basis of L (G). 



Consequences 

Structure of compact topological groups 

From the theorem, one can deduce a significant general structure theorem. Let G be a compact topological group, 

2 

which we assume Hausdorff. For any finite-dimensional G-invariant subspace V in L (G), where G acts on the left, 
we consider the image of G in GL(V). It is closed, since G is compact, and a subgroup of the Lie group GL(V). It 
follows by a basic theorem (of Elie Cartan) that the image of G is a Lie group also. 

If we now take the limit (in the sense of category theory) over all such spaces V, we get a result about G - because G 

2 

acts faithfully on L (G). We can say that G is an inverse limit of Lie groups. It may of course not itself be a Lie 
group: it may for example be a profinite group. 
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Supersymmetry 

In particle physics, supersymmetry (often abbreviated SUSY) is a symmetry that relates elementary particles of one 
spin to other particles that differ by half a unit of spin and are known as superpartners. In a theory with unbroken 
supersymmetry, for every type of boson there exists a corresponding type of fermion with the same mass and internal 
quantum numbers, and vice- versa. 

So far, there is only indirect evidence for the existence of supersymmetry Since the superpartners of the Standard 
Model particles have not been observed, supersymmetry, if it exists, must be a broken symmetry, allowing the 
superparticles to be heavier than the corresponding Standard Model particles. 

If supersymmetry exists close to the TeV energy scale, it allows for a solution of the hierarchy problem of the 
Standard Model, i.e., the fact that the Higgs boson mass is subject to quantum corrections which — barring 
extremely fine-tuned cancellations among independent contributions — would make it so large as to undermine the 
internal consistency of the theory. In supersymmetric theories, on the other hand, the contributions to the quantum 
corrections coming from Standard Model particles are naturally canceled by the contributions of the corresponding 
superpartners. Other attractive features of TeV-scale supersymmetry are the fact that it allows for the high-energy 
unification of the weak interactions, the strong interactions and electromagnetism, and the fact that it provides a 
candidate for Dark Matter and a natural mechanism for electro weak symmetry breaking. 

Another advantage of supersymmetry is that supersymmetric quantum field theory can sometimes be solved. 
Supersymmetry is also a feature of most versions of string theory, though it can exist in nature even if string theory 
is incorrect. 

The Minimal Supersymmetric Standard Model is one of the best studied candidates for physics beyond the Standard 
Model. Theories of gravity that are also invariant under supersymmetry are known as supergravity theories. 

History 

A supersymmetry relating mesons and baryons was first proposed, in the context of hadronic physics, by Hironari 
Miyazawa in 1966, but his work was ignored at the time.^ ^ ^ ^ In the early 1970s, J. L. Gervais and B. Sakita 
(in 1971), Yu. A. Golfand and E.P. Likhtman (also in 1971), D.V. Volkov and V.P. Akulov (in 1972) and J. Wess 
and B. Zumino (in 1974) independently rediscovered supersymmetry, a radically new type of symmetry of spacetime 
and fundamental fields, which establishes a relationship between elementary particles of different quantum nature, 
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bosons and fermions, and unifies spacetime and internal symmetries of the microscopic world. Supersymmetry first 
arose in the context of an early version of string theory by Pierre Ramond, John H. Schwarz and Andre Neveu, but 
the mathematical structure of supersymmetry has subsequently been applied successfully to other areas of physics; 
firstly by Wess, Zumino, and Abdus Salam and their fellow researchers to particle physics, and later to a variety of 
fields, ranging from quantum mechanics to statistical physics. It remains a vital part of many proposed theories of 
physics. 

The first realistic supersymmetric version of the Standard Model was proposed in 1981 by Howard Georgi and Savas 
Dimopoulos and is called the Minimal Supersymmetric Standard Model or MSSM for short. It was proposed to solve 
the hierarchy problem and predicts superpartners with masses between 100 GeV and 1 TeV. As of 2009 there is no 
irrefutable experimental evidence that supersymmetry is a symmetry of nature. In 2010 the Large Hadron Collider at 
CERN is scheduled to produce the world's highest energy collisions and offers the best chance at discovering 
superparticles for the foreseeable future. Recently prediction markets like intrade offered scientific contracts that 
give estimates for that probability. 

Applications 

Extension of possible symmetry groups 

One reason that physicists explored supersymmetry is because it offers an extension to the more familiar symmetries 
of quantum field theory. These symmetries are grouped into the Poincare group and internal symmetries and the 
Coleman-Mandula theorem showed that under certain assumptions, the symmetries of the S-matrix must be a direct 
product of the Poincare group with a compact internal symmetry group or if there is no mass gap, the conformal 
group with a compact internal symmetry group. In 1971 Golf and and Likhtman were the first to show that the 
Poincare algebra can be extended through introduction of four anticommuting spinor generators (in four 
dimensions), which later became known as supercharges. In 1975 the Haag-Lopuszanski-Sohnius theorem analyzed 
all possible superalgebras in the general form, including those with an extended number of the supergenerators and 
central charges. This extended super-Poincare algebra paved the way for obtaining a very large and important class 
of supersymmetric field theories. 

The supersymmetry algebra 

Traditional symmetries in physics are generated by objects that transform under the tensor representations of the 
Poincare group and internal symmetries. Supersymmetries, on the other hand, are generated by objects that transform 
under the spinor representations. According to the spin-statistics theorem, bosonic fields commute while fermionic 
fields anticommute. Combining the two kinds of fields into a single algebra requires the introduction of a Z 2 -grading 
under which the bosons are the even elements and the fermions are the odd elements. Such an algebra is called a Lie 
superalgebra. 

The simplest supersymmetric extension of the Poincare algebra is the Super-Poincare_algebra. Expressed in terms of 
two Weyl spinors, has the following anti-commutation relation: 

and all other anti-commutation relations between the Qs and commutation relations between the Qs and Ps vanish. In 
the above expression = —id^ are the generators of translation and are the Pauli matrices. 

There are representations of a Lie superalgebra that are analogous to representations of a Lie algebra. Each Lie 
algebra has an associated Lie group and a Lie superalgebra can sometimes be extended into representations of a Lie 
supergroup. 
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The Supersymmetric Standard Model 

Incorporating supersymmetry into the Standard Model requires doubling the number of particles since there is no 
way that any of the particles in the Standard Model can be superpartners of each other. With the addition of new 
particles, there are many possible new interactions. The simplest possible supersymmetric model consistent with the 
Standard Model is the Minimal Supersymmetric Standard Model (MSSM) which can include the necessary 
additional new particles that are able to be superpartners of those in the Standard Model. 

One of the main motivations for SUSY 
comes from the quadratically divergent 
contributions to the Higgs mass squared. 
The quantum mechanical interactions of the 
Higgs boson causes a large renormalization 
of the Higgs mass and unless there is an 
accidental cancellation, the natural size of 
the Higgs mass is the highest scale possible. 
This problem is known as the hierarchy 
problem. Supersymmetry reduces the size of 
the quantum corrections by having 
automatic cancellations between fermionic 
and bosonic Higgs interactions. If 
supersymmetry is restored at the weak scale, 
then the Higgs mass is related to 
supersymmetry breaking which can be 
induced from small non-perturbative effects 
explaining the vastly different scales in the 
weak interactions and gravitational interactions 
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Cancellation of the Higgs boson quadratic mass renormalization between fermionic 
top quark loop and scalar stop squark tadpole Feynman diagrams in a 
supersymmetric extension of the Standard Model 



In many supersymmetric Standard Models there is a heavy stable particle (such as neutralino) which could serve as a 
Weakly interacting massive particle (WIMP) dark matter candidate. The existence of a supersymmetric dark matter 
candidate is closely tied to R-parity. 

The standard paradigm for incorporating supersymmetry into a realistic theory is to have the underlying dynamics of 
the theory be supersymmetric, but the ground state of the theory does not respect the symmetry and supersymmetry 
is broken spontaneously. The supersymmetry break can not be done permanently by the particles of the MSSM as 
they currently appear. This means that there is a new sector of the theory that is responsible for the breaking. The 
only constraint on this new sector is that it must break supersymmetry permanently and must give superparticles TeV 
scale masses. There are many models that can do this and most of their details do not currently matter. In order to 
parameterize the relevant features of supersymmetry breaking, arbitrary soft SUSY breaking terms are added to the 
theory which temporarily break SUSY explicitly but could never arise from a complete theory of supersymmetry 
breaking. 



Gauge Coupling Unification 

One piece of evidence for supersymmetry existing is gauge coupling unification. The renormalization group 
evolution of the three gauge coupling constants of the Standard Model is somewhat sensitive to the present particle 
content of the theory. These coupling constants do not quite meet together at a common energy scale if we run the 
renormalization group using the Standard Model With the addition of minimal SUSY joint convergence of the 
coupling constants is projected at approximately 10 16 GeV.^ 
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Supersymmetric quantum mechanics 

Supersymmetric quantum mechanics adds the SUSY superalgebra to quantum mechanics as opposed to quantum 
field theory. Supersymmetric quantum mechanics often comes up when studying the dynamics of supersymmetric 
solitons and due to the simplified nature of having fields only functions of time (rather than space-time), a great deal 
of progress has been made in this subject and is now studied in its own right. 

SUSY quantum mechanics involves pairs of Hamiltonians which share a particular mathematical relationship, which 
are called partner Hamiltonians. (The potential energy terms which occur in the Hamiltonians are then called 
partner potentials.) An introductory theorem shows that for every eigenstate of one Hamiltonian, its partner 
Hamiltonian has a corresponding eigenstate with the same energy. This fact can be exploited to deduce many 
properties of the eigenstate spectrum. It is analogous to the original description of SUSY, which referred to bosons 
and fermions. We can imagine a "bosonic Hamiltonian", whose eigenstates are the various bosons of our theory. The 
SUSY partner of this Hamiltonian would be "fermionic", and its eigenstates would be the theory's fermions. Each 
boson would have a fermionic partner of equal energy. 

SUSY concepts have provided useful extensions to the WKB approximation. In addition, SUSY has been applied to 
non-quantum statistical mechanics through the Fokker-Planck equation. 

Mathematics 

SUSY is also sometimes studied mathematically for its intrinsic properties. This is because it describes complex 
fields satisfying a property known as holomorphy, which allows holomorphic quantities to be exactly computed. 
This makes supersymmetric models useful toy models of more realistic theories. A prime example of this has been 
the demonstration of S-duality in four-dimensional gauge theories that interchanges particles and monopoles. 

General supersymmetry 

Supersymmetry appears in many different contexts in theoretical physics that are closely related. It is possible to 
have multiple supersymmetries and also have supersymmetric extra dimensions. 

Extended supersymmetry 

It is possible to have more than one kind of supersymmetry transformation. Theories with more than one 
supersymmetry transformation are known as extended supersymmetric theories. The more supersymmetry a theory 
has, the more constrained the field content and interactions are. Typically the number of copies of a supersymmetry 
is a power of 2, i.e. 1, 2, 4, 8. In four dimensions, a spinor has four degrees of freedom and thus the minimal number 
of supersymmetry generators is four in four dimensions and having eight copies of supersymmetry means that there 
are 32 supersymmetry generators. 

The maximal number of supersymmetry generators possible is 32. Theories with more than 32 supersymmetry 
generators automatically have massless fields with spin greater than 2. It is not known how to make massless fields 
with spin greater than two interact, so the maximal number of supersymmetry generators considered is 32. This 
corresponds to an N= 8 supersymmetry theory. Theories with 32 supersymmetries automatically have a graviton. 

In four dimensions there are the following theories, with the corresponding multiplets ^ (CPT adds a copy, 
whenever they are not invariant under such symmetry) 

• N=l 

Chiral multiplet: (0, 1 / ) Vector multiplet: (V ,1) Gravitino multiplet: (l, 3 /) Graviton multiplet: ( 3 ,/,2) 

• N=2 

12 1 12 3 2 

hypermultiplet: (- /^,0 , vector multiplet: (0, ,1) supergravity multiplet: (1, ,2) 

• N=4 
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Vector multiplet: (-l,-/^ 4 ,0 6 , /^ 4 ,1) Supergravity multiplet: (0,V 2 4 ,l 6 , 3 / 2 4 ,2) 

Supergravity multiplet: (-2,- 3 / 2 8 ,-l 28 ,- 1 / 2 56 ,0 70 , 1 / 2 56 ,l 28 , 3 / 2 8 ,2) 
Supersymmetry in alternate numbers of dimensions 

It is possible to have supersymmetry in dimensions other than four. Because the properties of spinors change 
drastically between different dimensions, each dimension has its characteristic. In d dimensions, the size of spinors is 
roughly 2 dl2 or 2^ d ~ ^ /2 . Since the maximum number of supersymmetries is 32, the greatest number of dimensions in 
which a supersymmetric theory can exist is eleven. 

Supersymmetry as a quantum group 

Supersymmetry can be reinterpreted in the language of noncommutative geometry and quantum groups. In 
particular, it involves a mild form of noncommutativity, namely supercommutativity. See the main article for more 
details. 

Supersymmetry in quantum gravity 

Supersymmetry is part of a larger enterprise of theoretical physics to unify everything we know about the physical 
world into a single fundamental framework of physical laws, known as the quest for a Theory of Everything (TOE). 
A significant part of this larger enterprise is the quest for a theory of quantum gravity, which would unify the 
classical theory of general relativity and the Standard Model, which explains the other three basic forces in physics 
(electromagnetism, the strong interaction, and the weak interaction), and provides a palette of fundamental particles 
upon which all four forces act. Two of the most active approaches to forming a theory of quantum gravity are string 
theory and loop quantum gravity (LQG), although in theory, supersymmetry could be a component of other 
theoretical approaches as well. 

For string theory to be consistent, supersymmetry appears to be required at some level (although it may be a strongly 
broken symmetry). In particle theory, supersymmetry is recognized as a way to stabilize the hierarchy between the 
unification scale and the electro weak scale (or the Higgs boson mass), and can also provide a natural dark matter 
candidate. String theory also requires extra spatial dimensions which have to be compactified as in Kaluza-Klein 
theory. 

Loop quantum gravity (LQG), in its current formulation, predicts no additional spatial dimensions, nor anything else 
about particle physics. These theories can be formulated in three spatial dimensions and one dimension of time, 
although in some LQG theories dimensionality is an emergent property of the theory, rather than a fundamental 
assumption of the theory. Also, LQG is a theory of quantum gravity which does not require supersymmetry. Lee 
Smolin, one of the originators of LQG, has proposed that a loop quantum gravity theory incorporating either 
supersymmetry or extra dimensions, or both, be called "loop quantum gravity II". 

If experimental evidence confirms supersymmetry in the form of supersymmetric particles such as the neutralino that 
is often believed to be the lightest superpartner, some people believe this would be a major boost to string theory. 
Since supersymmetry is a required component of string theory, any discovered supersymmetry would be consistent 
with string theory. If the Large Hadron Collider and other major particle physics experiments fail to detect 
supersymmetric partners or evidence of extra dimensions, many versions of string theory which had predicted certain 
low mass superpartners to existing particles may need to be significantly revised. The failure of experiments to 
discover either supersymmetric partners or extra spatial dimensions, as of 2009, has encouraged loop quantum 
gravity researchers. 
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Current Limits 

The tightest limits will of course come from direct production at colliders. Both the Large Electron-Positron Collider 
and Tevatron have set limits for specific models which have not yet been exceeded by the Large Hadron Collider. 
Searches are only applicable for a finite set of tested points because simulation using the Monte Carlo method must 
be made so that limits for that particular model can be calculated. This complicates matters because different 
experiments have looked at different sets of points. Some extrapolation between points can be made within particular 
models but it is difficult to set general limits even for the Minimal Super symmetric Standard Model. 

The first mass limits for squarks and gluinos were made at CERN by the UA1 experiment and the UA2 experiment 
at the Super Proton Synchrotron. LEP later set very strong limits which are still relevant today . Most recently 
these limits were extended by the DO experiment ^ ^ 

See also 

• Concise Encyclopedia of Supersymmetry (book) 

• Minimal Supersymmetric Standard Model 

• Quantum group 

• Supercharge 

• Supergeometry 

• Supergravity 

• Supergroup 

• Superspace 
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Invariance mechanics 



In physics, invariance mechanics, in its simplest form, is the rewriting of the laws of quantum field theory in terms 
of invariant quantities only. For example, the positions of a set of particles in a particular coordinate system is not 
invariant under translations of the system. However, the (4-dimensional) distances between the particles is invariant 
under translations, rotations and Lorentz transformations of the system. 

The invariant quantities made from the input and output states of a system are the only quantities needed to give a 
probability amplitude to a given system. This is what is meant by the system obeying a symmetry. Since all the 
quantities involved are relative quantities, invariance mechanics, can be thought of as taking relativity theory to its 
natural limit. 

Invariance mechanics has strong links with loop quantum gravity in which the invariant quantities are based on 
angular momentum. In invariance mechanics, space and time come secondary to the invariants and are seen as useful 
concepts that emerge only in the large scale limit. 

Feynman rules 

The Feynman rules of a quantum system can be rewritten in terms of invariant quantities (plus constants such as 
mass, charge, etc.) The invariant quantities depend on the type of particle, scalar, vector or spinor. The rules often 
involve geometric quantities such as the volumes of simplices formed from vertices of the Feynman graphs. 

Scalar particles 

In a system of scalar particles, the only invariant quantities are the 4-dimensional distances (intervals) between the 
starting points ( x ) and ending points ( x ! ) °f me particle paths. These points form a complete graph: 



The invariants are the numbers 
R — \x — x\. 

Vector particles 

In a system of vector particles such as photons, the invariants are the 4-dimensional distances between the starting 
points and ending points of the particle paths, and the angles between the starting and ending polarisation vectors of 
the photons { Pn) 

There are four invariants on each line: 

R — \x — x\ 

s = PvP'v 
T=(x-x%p' fi 

U = pp{x - x% 
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Yang-Mills vector particles 

Yang-Mills vector fields of a given gauge group also involve the angle representing a rotation of the gauge group ( 

There are three invariants on each line: 
R = \x-x'\ 

s = tW> 

T = ptfx - x%(* - x>) T p». 
Spinor fields 

These involve the angles between the spinor vectors. The invariants are: 
R = |x-x'|, 

s = ^ 75 "V, 

T = u a if{x - x')jf. 
So for example, the fermion propagator is defined in relation to the massless scalar propagator as 

G+(R, S,T,m)=(jL + ?-JL- mS^j G F (R, m). 
Mixed systems 

Systems usually consist of a mixture of scalar, spinor and vector fields and the invariants can depend on angles 
between spinors and vectors. To simplify this process ideas from Twistor theory are often used which enables one to 
decompose a null- vector into a pair of spinors. Alternatively 3 -point invariants can be introduced such as the 
spinor -spinor -vector triangle invariant: 

V = u-jfp^. 

It is important to note that some types of invariants are combinations of other types invariants, for example the 
angles in a complete-graph are invariants but they can be found as combinations of distance invariants. 

In chromodynamics, for example, there are 4-point invariants also. So for a completely specified system you would 
have several numbers assigned to each line, triangle and tetrahedron in a complete graph representing the system. 

One outstanding problem is that of enumerating all the possible invariants which can be made from the various spin 
and polarisation vectors. 

Constraints 

A system represented by a complete graph contains many invariant quantities. For large graphs, however, not all 
these quantities are independent and we must specify dimensional and gauge constraints. Why the particular number 
of dimensions or particular gauge group is chosen is still not known. The constraints and whether they are satisfied 
exactly or approximately is the key to invariance mechanics and the difference between it and conventional field 
theory. Work is being done to see whether the breaking of these constraints is a consequence of the gravitational 
field. If the constraints are satisfied only approximately, i.e. if there is a quantum uncertainty in the constraints then 
they are best thought of as local maxima of the amplitudes of a system which occur due to the specific Feynman 
rules used. 
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Dimensions 

Since invariance mechanics does not explicitly use coordinate systems, the definition of dimension is slightly 
different. The equivalent way of expressing the number of dimensions is given, as in distance geometry, as 
specifying that the volume of any (D + 2)-simplex made from the points in the system is zero. The volume of a 
simplex is given by a formula involving the invariant distances (the R's) between the points which is given by the 
Cayley-Menger determinants. If this determinant is exactly 0 for all simplices then the geometry is Euclidean. If the 
determinant is only approximately 0 then at small distances space-time is non-Euclidean. This has deep connections 
with quantum foam and loop quantum gravity. 

For Minkowski space, or for any space with signature (+ + + ... + -) this makes no difference to the formulae for 
invariance mechanics. 

Gravity 

By allowing quantum uncertainty in the dimensional constraints (which involves replacing delta functions with 
reciprocal functions in the equations), the geometry is no longer confined to flat space-time, this break from flat 
space-time can be seen as a curvature and, just as in General Relativity can be seen as the cause of gravity. This is 
called 'off-dimension' physics in analogy to off-shell physics. 

Gauge group 

In a similar way to expressing the number of dimensions, the dimension and type of the gauge group is given by an 
identity involving the polarisation (or spin) invariants (the S, T and U's). In the simple cases such as for the photon, 
these are simply spherical versions of the Cayley-Menger determinants. The gauge group is an internal symmetry 
because the gauge identity involves far more quantities than the dimensional identity. A simple gauge group such as 
SU(5) or .E 6 involves less invariants than a non-simple gauge group such as U(l)xSU(2)xSU(3) (see: Standard 
Model). There has been recent work on combining the dimensional and gauge constraints into a single equation to 
produce a unified theory. It is thought that this will be achieved by combining of the invariants on each line into a 
single complex number (or hypercomplex number). 

Supersymmetry 

In the supersymmetric model, some of the spinor invariants and vector invariants are combined together into a single 
invariant. Having less invariants means that there is more symmetry and more transformations are possible such as 
transformations between fermions and bosons. It is believed, although currently unproved, that the minimum number 
of invariants on each line of a complete graph representing a system is two - those being the 4-dimesional distances 
(the R's) and an angle representing the rotation from one particle 'flavour' to another particle 'flavour' (the T's). Some 
have suggested that even these invariants can be combined into one by saying that the 4 dimensions of space and 
time are just 4 more flavours that a particle can have, albeit ones which can change very little (compared with the 
size of the Universe as a whole). Models of this type imply that the universe has an overall spherical geometry. The 
mixing of space-time and flavour symmetries adds an additional degree of freedom to a particle's light-cone which 
appears as a unique mass for each particle depending on the flavour. 

Having a small number of invariants doesn't necessarily make a simpler model since all the complexity of the model 
is bound up in the constraints which can be polynomials of hundreds of variables. One of the primary aims of 
invariance mechanics is to find these polynomial(s) and to find which symmetry group they correspond to. Many 
believe that the permutation of the variables of these polynomial(s) will correspond to one of the special sporadic 
groups. (Interestingly, only the largest sporadic group, the monster group is big enough to incorporate the constraints 
for the Standard Model). The other main aim is to find appropriate Feynman rules on the invariants which both 
accurately describe nature and don't lead to infinities. 
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M-theory 

Although invariance mechanics was born out of trying to understand point particle theory, possible connections with 
superstring and M-theory have emerged. The argument is that the smallest simplex which needs a constraint to be 
4-dimensional is the 6-simplex. This can be viewed as the endpoints of a 3-simplex (a triangular membrane) moving 
through time. The propagator function of this would be 1/Vq which is the inverse of the volume of a 6-simplex. In 
other words the greatest probability would be when the volume of this 6-simplex is 0 and hence it is embedded in 4 
dimensions. Hence the propagator for a particle would the same as the dimensional constraint. So if the Universe is 
built out of 6 simplices then the dimensional constraint can be applied to all simplices. Other fields of work are 
investigating whether the distance invariants may take only discrete values and whether areas or volumes should be 
taken as the fundamental invariants. (The dual of loop quantum gravity involves quantisized areas). 
Others take the view that in invariance mechanics it should be irrelevant whether you view the fundamental 
constituents as particles or strings or membranes and a more formal approach is called for. 

History 

The history of invariance mechanics is difficult to pinpoint since many people have been working on it without 
realising that they were working on invariance mechanics. Notable milestones include the 4-dimensional invariant 
found by Einstein in special relativity (1905), Yang-Mills gauge invariants theory. Roger Penrose and his 
spin-networks (1960's) influenced the subject. Cayley-Menger and their invariant based metric theory was an 
important milestone. Recently Baratin-Freidel (2006) have demonstrated the connection between invariance 
mechanics and loop quantum gravity. 



External links 

• Introduction to Invariance Mechanics ^ 

• Renormalization of Crumpled Manifolds 

• Hidden Quantum Gravity in 4d Feynman Diagrams 
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Wigner's theorem 

Wigner's theorem is a cornerstone of the mathematical formulation of quantum mechanics. It was proved by 
Eugene Wigner in 1931^ . The theorem specifies how physical symmetries such as rotations, translations, CPT, etc. 
act on the Hilbert space of states. According to the theorem any symmetry acts as a unitary or anti-unitary 
transformation in the Hilbert space. 

More precisely, it states that a surjective map T : H — > H on a complex Hilbert space H , which satisfies 
\(Tz,Ty)\ = \{x,y)\ 

for all x,y £ H , has the form Tx — (f(x)Uxfor all x (E H , where [p : H — > C has modulus one and 
JJ : H — > H is either unitary or antiunitary. 
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Wigner's classification 

In mathematics and theoretical physics, Wigner's classification is a classification of the nonnegative energy 
irreducible unitary representations of the Poincare group, which have sharp mass eigenvalues. It was proposed by 
Eugene Wigner, for reasons coming from physics — see the article particle physics and representation theory. 

The mass m = y^p^is a Casimir invariant of the Poincare group. So, we can classify the representations 
according to whether m > 0, m = Obut _P 0 > 0 an d m = Oand P = 0 • 

For the first case, we note that the eigenspace (see generalized eigenspaces of unbounded operators) associated with 
Pq — m and P i = Ois a representation of SO(3). In the ray interpretation, we can go over to Spin(3) instead. So, 
massive states are classified by an irreducible Spin(3) unitary and a positive mass, m . 

For the second case, we look at the stabilizer ofP 0 = fc, P 3 = — fc,i^ = 0, i = l,2. This is the double 
cover of SE(2) (see unit ray representation). We have two cases, one where irreps are described by an integral 
multiple of 1/2, called the helicity and the other called the "continuous spin" representation. 

The last case describes the vacuum. The only finite dimensional unitary solution is the trivial representation called 
the vacuum. 

The double cover of the Poincare group admits no central extensions. 

Note: This classification leaves out tachyonic solutions, solutions with no fixed mass, infraparticles with no fixed 
mass, etc. 
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Quantum hydrodynamics 

Quantum hydrodynamics (QHD) is most generally the study of hydrodynamic systems which demonstrate 
behavior implicit in quantum subsystems (usually quantum tunneling). They arise in semiclassical mechanics in the 
study of semiconductor devices, in which case being derived from the Wigner-Boltzmann equation. In quantum 
chemistry they arise as solutions to chemical kinetic systems, in which case they are derived from the Schrodinger 
equation by way of Madelung equations. 

An important system of study in quantum hydrodynamics is that of superfluidity. Some other topics of interest in 
quantum hydrodynamics are quantum turbulence, quantized vortices, first, second and third sound, and quantum 
solvents. The quantum hydrodynamic equation is an equation in Bohmian mechanics, which, it turns out, has a 
mathematical relationship to classical fluid dynamics (see Madelung equations). This is a rich theoretical field. 

Some common experimental applications of these studies are in liquid helium (He-3 and He-4), and of the interior of 
neutron stars and the quark-gluon plasma. Many famous scientists have worked in quantum hydrodynamics, 
including Richard Feynman, Lev Landau, and Pyotr L. Kapitsa. 
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Quantum magnetodynamics 

Quantum electrodynamics (QED) is the relativistic quantum field theory of electrodynamics. In essence, it 
describes how light and matter interact and is the first theory where full agreement between quantum mechanics and 
special relativity is achieved. QED mathematically describes all phenomena involving electrically charged particles 
interacting by means of exchange of photons and represents the quantum counterpart of classical electrodynamics 
giving a complete account of matter and light interaction. One of the founding fathers of QED, Richard Feynman, 
has called it "the jewel of physics" for its extremely accurate predictions of quantities like the anomalous magnetic 
moment of the electron, and the Lamb shift of the energy levels of hydrogen 

In technical terms, QED can be described as a perturbation theory of the electromagnetic quantum vacuum. 




Paul Dirac 



History 

The first formulation of a quantum theory describing radiation and matter interaction is due to Paul Adrien Maurice 
Dirac, who, during 1920, was first able to compute the coefficient of spontaneous emission of an atom. 1 

Dirac described the quantization of the electromagnetic field as an ensemble of harmonic 
oscillators with the introduction of the concept of creation and annihilation operators of 
particles. In the following years, with contributions from Wolfgang Pauli, Eugene 
Wigner, Pascual Jordan, Werner Heisenberg and an elegant formulation of quantum 
electrodynamics due to Enrico Fermi, physicists came to believe that, in principle, it 
would be possible to perform any computation for any physical process involving 
photons and charged particles. However, further studies by Felix Bloch with Arnold 
Nordsieck,^ and Victor Weisskopf,^ in 1937 and 1939, revealed that such 
computations were reliable only at a first order of perturbation theory, a problem already 
pointed out by Robert Oppenheimer.^ At higher orders in the series infinities emerged, 
making such computations meaningless and casting serious doubts on the internal consistency of the theory itself. 
With no solution for this problem known at the time, it appeared that a fundamental incompatibility existed between 
special relativity and quantum mechanics . 

Difficulties with the theory increased through the end of 1940. Improvements in 
microwave technology made it possible to take more precise measurements of the shift 
of the levels of a hydrogen atom, now known as the Lamb shift and magnetic moment 
of the electron. ^ These experiments unequivocally exposed discrepancies which the 
theory was unable to explain. 

A first indication of a possible way out was given by Hans Bethe. In 1947, while he was 

rm 

traveling by train to reach Schenectady from New York, after giving a talk at the 
conference at Shelter Island on the subject, Bethe completed the first non-relativistic 
computation of the shift of the lines of the hydrogen atom as measured by Lamb and 
RetherfordJ 10 ^ Despite the limitations of the computation, agreement was excellent. The 
idea was simply to attach infinities to corrections at mass and charge that were actually 
fixed to a finite value by experiments. In this way, the infinities get absorbed in those constants and yield a finite 
result in good agreement with experiments. This procedure was named renormalization. 




Hans Bethe 
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Based on Bethe's intuition and fundamental 
papers on the subject by Sin-Itiro 
TomonagaJ 11 ^ Julian Schwinger/ 12 ^ ^ 
Richard Feynman^ 14 ^ ^ 15 ^ ^ and Freeman 
Dyson^ ^ , it was finally possible to get 
fully covariant formulations that were finite 
at any order in a perturbation series of 
quantum electrodynamics. Sin-Itiro 
Tomonaga, Julian Schwinger and Richard 
Feynman were jointly awarded with a Nobel 
prize in physics in 1965 for their work in 
this areaJ 19 ^ Their contributions, and those 
of Freeman Dyson, were about covariant 
and gauge invariant formulations of 
quantum electrodynamics that allow 
computations of observables at any order of 
perturbation theory. Feynman' s 

mathematical technique, based on his 
diagrams, initially seemed very different 
from the field- theoretic, operator-based 
approach of Schwinger and Tomonaga, but 
Freeman Dyson later showed that the two 
approaches were equivalents 17 ^ 

Renormalization, the need to attach a 
physical meaning at certain divergences 
appearing in the theory through integrals, 
has subsequently become one of the 
fundamental aspects of quantum field theory 

and has come to be seen as a criterion for a theory's general acceptability. Even though renormalization works very 
well in practice, Feynman was never entirely comfortable with its mathematical validity, even referring to 



Shelter Island Conference group photo (Courtesy of Archives, National Academy 

of Sciences). 




Feynman (center) and Oppenheimer (left) 
at Los Alamos. 



renormalization as a "shell game" and "hocus pocus". 



, [20] 



QED has served as the model and template for all subsequent quantum field theories. One such subsequent theory is 
quantum chromodynamics, which began in the early 1960s and attained its present form in the 1975 work by H. 
David Politzer, Sidney Coleman, David Gross and Frank Wilczek. Building on the pioneering work of Schwinger, 
Gerald Guralnik, Dick Hagen, and Tom Kibble, Peter Higgs, Jeffrey Goldstone, and others, Sheldon 

Glashow, Steven Weinberg and Abdus Salam independently showed how the weak nuclear force and quantum 
electrodynamics could be merged into a single electroweak force. 
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Feynman's view of quantum electrodynamics 



Introduction 

Near the end of his life, Richard P. Feynman gave a series of lectures on QED intended for the lay public. These 
lectures were transcribed and published as Feynman (1985), QED: The strange theory of light and matter} 1 ^ ^ a 
classic non-mathematical exposition of QED from the point of view articulated below. 

The key components of Feynman's presentation of QED are three basic actions. 

• A photon goes from one place and time to another place and time. 

• An electron goes from one place and time to another place and time. 

• An electron emits or absorbs a photon at a certain place and time. 

These actions are represented in a form of 
visual shorthand by the three basic elements 
of Feynman diagrams: a wavy line for the 
photon, a straight line for the electron and a 
junction of two straight lines and a wavy 
one for a vertex representing emission or 
absorption of a photon by an electron. These 
may all be seen in the adjacent diagram. 



Time 



B 



D 




A 

PHOTON 



ELECTRON 



PHOTON EMISSION OR ABSORPTION 



Space 



Feynman diagram elements 



It is important not to over-interpret these 
diagrams. Nothing is implied about how a 
particle gets from one point to another. The 
diagrams do not imply that the particles are 
moving in straight or curved lines. They do 
not imply that the particles are moving with 
fixed speeds. The fact that the photon is often represented, by convention, by a wavy line and not a straight one does 
not imply that it is thought that it is more wavelike than is an electron. The images are just symbols to represent the 
actions above: photons and electrons do, somehow, move from point to point and electrons, somehow, emit and 
absorb photons. We do not know how these things happen, but the theory tells us about the probabilities of these 
things happening. Trajectory is a meaningless concept in quantum mechanics. 

As well as the visual shorthand for the actions Feynman introduces another kind of shorthand for the numerical 
quantities which tell us about the probabilities. If a photon moves from one place and time - in shorthand, A - to 
another place and time — shorthand, B - the associated quantity is written in Feynman's shorthand as P(A to B). The 
similar quantity for an electron moving from C to D is written E(C to D). The quantity which tells us about the 
probability for the emission or absorption of a photon he calls 'j'. This is related to, but not the same as, the measured 
electron charge 'e'. 

QED is based on the assumption that complex interactions of many electrons and photons can be represented by 
fitting together a suitable collection of the above three building blocks, and then using the probability-quantities to 
calculate the probability of any such complex interaction. It turns out that the basic idea of QED can be 
communicated while making the assumption that the quantities mentioned above are just our everyday probabilities. 
(A simplification of Feynman's book.) Later on this will be corrected to include specifically quantum mathematics, 
following Feynman. 

The basic rules of probabilities that will be used are that a) if an event can happen in a variety of different ways then 
its probability is the sum of the probabilities of the possible ways and b) if a process involves a number of 
independent subprocesses then its probability is the product of the component probabilities. 
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Basic constructions 




Suppose we start with one electron at a certain place and time (this place and time being given the arbitrary label A) 
and a photon at another place and time (given the label B). A typical question from a physical standpoint is: 'What is 
the probability of finding an electron at C (another place and a later time) and a photon at D (yet another place and 
time)?'. The simplest process to achieve this end is for the electron to move from A to C (an elementary action) and 
that the photon moves from B to D (another elementary action). From a knowledge of the probabilities of each of 
these subprocesses - E(A to C) and P(B to D) - then we would expect to calculate the probability of both happening 
by multiplying them, using rule b) above. This gives a simple estimated answer to our question. 

But there are other ways in which the end result could come about. 
The electron might move to a place and time E where it absorbs 
the photon; then move on before emitting another photon at F; 
then move on to C where it is detected, while the new photon 
moves on to D. The probability of this complex process can again 
be calculated by knowing the probabilities of each of the 
individual actions: three electron actions, two photon actions and 
two vertexes - one emission and one absorption. We would expect 
to find the total probability by multiplying the probabilities of each 
of the actions, for any chosen positions of E and F. We then, using 
rule a) above, have to add up all these probabilities for all the 
alternatives for E and F. (This is not elementary in practice, and 
involves integration.) But there is another possibility: that is that the electron first moves to G where it emits a 
photon which goes on to D, while the electron moves on to H, where it absorbs the first photon, before moving on to 
C. Again we can calculate the probability of these possibilities (for all points G and H). We then have a better 
estimation for the total probability by adding the probabilities of these two possibilities to our original simple 
estimate. Incidentally the name given to this process of a photon interacting with an electron in this way is Compton 
Scattering. 



Compton scattering 



There are an infinite number of other intermediate processes in which more and more photons are absorbed and/or 
emitted. For each of these possibilities there is a Feynman diagram describing it. This implies a complex 
computation for the resulting probabilities, but provided it is the case that the more complicated the diagram the less 
it contributes to the result, it is only a matter of time and effort to find as accurate an answer as one wants to the 
original question. This is the basic approach of QED. To calculate the probability of any interactive process between 
electrons and photons it is a matter of first noting, with Feynman diagrams, all the possible ways in which the 
process can be constructed from the three basic elements. Each diagram involves some calculation involving definite 
rules to find the associated probability. 

That basic scaffolding remains when one moves to a quantum description but some conceptual changes are 
requested. One is that whereas we might expect in our everyday life that there would be some constraints on the 
points to which a particle can move, that is not true in full quantum electrodynamics. There is a certain possibility of 
an electron or photon at A moving as a basic action to any other place and time in the universe. That includes places 
that could only be reached at speeds greater than that of light and also earlier times. (An electron moving backwards 
in time can be viewed as a positron moving forward in time.) 
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Probability amplitudes 




Addition of probability amplitudes as complex 
numbers 



Quantum mechanics introduces an important change on the way 
probabilities are computed. It has been found that the quantities 
which we have to use to represent the probabilities are not the 
usual real numbers we use for probabilities in our everyday world, 
but complex numbers which are called probability amplitudes. 
Feynman avoids exposing the reader to the mathematics of 
complex numbers by using a simple but accurate representation of 
them as arrows on a piece of paper or screen. (These must not be 
confused with the arrows of Feynman diagrams which are actually 
simplified representations in two dimensions of a relationship 
between points in three dimensions of space and one of time.) The 

amplitude-arrows are fundamental to the description of the world given by quantum theory. No satisfactory reason 
has been given for why they are needed. But pragmatically we have to accept that they are an essential part of our 
description of all quantum phenomena. They are related to our everyday ideas of probability by the simple rule that 
the probability of an event is the square of the length of the corresponding amplitude-arrow. So, for a given process, 
if two probability amplitudes, v and w, are involved, the probability of the process will be given either by 

P = |v + w| 2 

or 

p = | v x w| 2 - 

The rules as regards adding or multiplying, however, are the same as above. But where you would expect to add or 
multiply probabilities, instead you add or multiply probability amplitudes that now are complex numbers. 

Addition and multiplication are familiar operations in the theory of 
complex numbers and are given in the figures. The sum is found as 
follows. Let the start of the second arrow be at the end of the first. 
The sum is then a third arrow that goes directly from the start of 
the first to the end of the second. The product of two arrows is an 
arrow whose length is the product of the two lengths. The 
direction of the product is found by adding the angles that each of 
the two have been turned through relative to a reference direction: 
that gives the angle that the product is turned relative to the 
reference direction. 

That change, from probabilities to probability amplitudes, 
complicates the mathematics without changing the basic approach. But that change is still not quite enough because 
it fails to take into account the fact that both photons and electrons can be polarized, which is to say that their 
orientation in space and time have to be taken into account. Therefore P(A to B) actually consists of 16 complex 
numbers, or probability amplitude arrows. There are also some minor changes to do with the quantity "j", which may 
have to be rotated by a multiple of 90° for some polarizations, which is only of interest for the detailed bookkeeping. 

Associated with the fact that the electron can be polarized is another small necessary detail which is connected with 
the fact that an electron is a Fermion and obeys Fermi-Dirac statistics. The basic rule is that if we have the 
probability amplitude for a given complex process involving more than one electron, then when we include (as we 
always must) the complementary Feynman diagram in which we just exchange two electron events, the resulting 
amplitude is the reverse — the negative — of the first. The simplest case would be two electrons starting at A and B 
ending at C and D. The amplitude would be calculated as the "difference", E(A to B)xE(C to D) — E(A to C)xE(B to 
D), where we would expect, from our everyday idea of probabilities, that it would be a sum. 
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Propagators 

Finally, one has to compute P(A to B) and E (C to D) corresponding to the probability amplitudes for the photon and 
the electron respectively. These are essentially the solutions of the Dirac Equation which describes the behavior of 
the electron's probability amplitude and the Klein-Gordon equation which describes the behavior of the photon's 
probability amplitude. These are called Feynman propagators. The translation to a notation commonly used in the 
standard literature is as follows: 

P(A to B) -> D F (x B - x A ), E(C to D) -> S F (x D - x c ) 
where a shorthand symbol such as Xj± stands for the four real numbers which give the time and position in three 
dimensions of the point labeled A. 



Mass renormalization 



A problem arose historically which held up progress for twenty 
years: although we start with the assumption of three basic 
"simple" actions, the rules of the game say that if we want to 
calculate the probability amplitude for an electron to get from A to 
B we must take into account all the possible ways: all possible 
Feynman diagrams with those end points. Thus there will be a way 
in which the electron travels to C, emits a photon there and then 
absorbs it again at D before moving on to B. Or it could do this 
kind of thing twice, or more. In short we have a fractal-like 
situation in which if we look closely at a line it breaks up into a 
collection of "simple" lines, each of which, if looked at closely, are 
in turn composed of "simple" lines, and so on ad infinitum. This is 
a very difficult situation to handle. If adding that detail only 
altered things slightly then it would not have been too bad, but 
disaster struck when it was found that the simple correction mentioned above led to infinite probability amplitudes. 
In time this problem was "fixed" by the technique of renormalization (see below and the article on mass 
renormalization). However, Feynman himself remained unhappy about it, calling it a "dippy process" P 0 ^ 




Electron self-energy loop 



Conclusions 

Within the above framework physicists were then able to calculate to a high degree of accuracy some of the 
properties of electrons, such as the anomalous magnetic dipole moment. However, as Feynman points out, it fails 
totally to explain why particles such as the electron have the masses they do. "There is no theory that adequately 
explains these numbers. We use the numbers in all our theories, but we don't understand them — what they are, or 
where they come from. I believe that from a fundamental point of view, this is a very interesting and serious 
problem. "^ 



Quantum magnetodynamics 



413 



Mathematics 

Mathematically, QED is an abelian gauge theory with the symmetry group U(l). The gauge field, which mediates 
the interaction between the charged spin- 1/2 fields, is the electromagnetic field. The QED Lagrangian for a spin- 1/2 
field interacting with the electromagnetic field is given by the real part of 

c = v^7 m £>m - - ^ V . 

where 

7^ are Dirac matrices; 

ij; a bispinor field of spin- 1/2 particles (e.g. electron-positron field); 

= i/^7o> called "psi-bar", is sometimes referred to as Dirac adjoint; 
Dp — dp + icAp + ieB^is the gauge covariant derivative; 
e is the coupling constant, equal to the electric charge of the bispinor field; 

is the covariant four-potential of the electromagnetic field generated by the electron itself; 
Bp is the external field imposed by external source; 

— d^Av — d^A^is the electromagnetic field tensor. 

Equations of motion 

To begin, substituting the definition of D into the Lagrangian gives us: 

£ = ijrfW ~ ehM* + - m W> ~ iff***"- 

Next, we can substitute this Lagrangian into the Euler-Lagrange equation of motion for a field: 



to find the field equations for QED. 

The two terms from this Lagrangian are then: 

— = -efa^A* + Bfl ) - 

Substituting these two back into the Euler-Lagrange equation (2) results in: 

iByfrf + e^(A^ + B^)+m^ = 0 
with complex conjugate: 

i-fdyft - ej^A* 1 + B^ip - rmj> = 0. 
Bringing the middle term to the right-hand side transforms this second equation into: 



The left-hand side is like the original Dirac equation and the right-hand side is the interaction with the 
electromagnetic field. 

One further important equation can be found by substituting the Lagrangian into another Euler-Lagrange equation, 
this time for the field, J^ 1 : 
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The two terms this time are: 

dC 

— = -e+f* 
and these two terms, when substituted back into (3) give us: 



Interaction picture 

This theory can be straightforwardly quantized treating bosonic and fermionic sectors as free. This permits to build a 
set of asymptotic states to start a computation of the probability amplitudes for different processes. In order to be 
able to do so, we have to compute an evolution operator that, for a given initial state, will give a final state in such a 
way to have 

M fi = (f\U\i). 

This technique is also known as the S-Matrix. Evolution operator is obtained in the interaction picture where time 
evolution is given by the interaction Hamiltonian. So, from equations above is 

V = e J Sx^^A^ 

and so, one has 

U = Texp I--*- f dt f V{t') 
L h J to 

being T the time ordering operator. This evolution operator has only a meaning as a series and what we get here is a 
perturbation series with a development parameter being fine structure constant. This series is named Dyson series. 

Feynman diagrams 

Despite the conceptual clarity of this Feynman approach to QED, almost no textbooks follow him in their 
presentation. When performing calculations it is much easier to work with the Fourier transforms of the propagators. 
Quantum physics considers particle's momenta rather than their positions, and it is convenient to think of particles as 
being created or annihilated when they interact. Feynman diagrams then look the same, but the lines have different 
interpretations. The electron line represents an electron with a given energy and momentum, with a similar 
interpretation of the photon line. A vertex diagram represents the annihilation of one electron and the creation of 
another together with the absorption or creation of a photon, each having specified energies and momenta. 

Using Wick theorem on the terms of the Dyson series, all the terms of the S-matrix for quantum electrodynamics can 
be computed through the technique of Feynman diagrams. In this case rules for drawing are the following 
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a fr- /3 




P 2 + 2£ 



-ze7^(27r) 4 ^(p 1+ p 2 +p3)^ 



Incoming antifermion: a • -» s) 

Outgoing fermion: • ^ a -> u a (p,s) 

Outgoing antifermion: • a -> '^a(p>s) 

Incoming photon: ^ *wx^v« -> 6^ (ft, A) 

Outgoing photon: r\-^\-^ ft e^(fc, A)* 

To these rules we must add a further one for closed loops that implies an integration on momenta J cftp / (2tt)^ • 

From them, computations of probability amplitudes are straightforwardly given. An example is Compton scattering, 
with an electron and a photon undergoing elastic scattering. Feynman diagrams are in this case 
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and so we are able to get the corresponding amplitude at the first order of a perturbation series for S -matrix: 
M fi = (ie) 2 u(p',sW(k'A') 



( p + fc )2_ m 2 

from which we are able to compute the cross section for this scattering. 



(p — k f ) 2 — m\ 



Renormalizability 

Higher order terms can be straightforwardly computed for the evolution operator but these terms display diagrams 
containing the following simpler ones 




One-loop contribution to the 
vacuum polarization function 

n 



One-loop contribution to the electron 
self-energy function Yj 




One-loop 
contribution 
to the vertex 
function P 



that, being closed loops, imply the presence of diverging integrals having no mathematical meaning. To overcome 
this difficulty, a technique like renormalization has been devised, producing finite results in very close agreement 
with experiments. It is important to note that a criterion for theory being meaningful after renormalization is that the 
number of diverging diagrams is finite. In this case the theory is said renormalizable. The reason for this is that to 
get observables renormalized one needs a finite number of constants to maintain the predictive value of the theory 
untouched. This is exactly the case of quantum electrodynamics displaying just three diverging diagrams. This 
procedure gives observables in very close agreement with experiment as seen e.g. for electron gyromagnetic ratio. 

Renormalizability has become an essential criterion for a quantum field theory to be considered as a viable one. All 
the theories describing fundamental interactions, except gravitation whose quantum counterpart is presently under 
very active research, are renormalizable theories. 
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Nonconvergence of series 

An argument by Freeman Dyson shows that the radius of convergence of the perturbation series in QED is zeroJ 24 ^ 
The basic argument goes as follows: if the coupling constant were negative, this would be equivalent to the Coulomb 
force constant being negative. This would "reverse" the electromagnetic interaction so that like charges would attract 
and unlike charges would repel. This would render the vacuum unstable against decay into a cluster of electrons on 
one side of the universe and a cluster of positrons on the other side of the universe. Because the theory is sick for any 
negative value of the coupling constant, the series do not converge, but are an asymptotic series. This can be taken as 
a need for a new theory, a problem with perturbation theory, or ignored by taking a "shut-up-and-calculate" 
approach. 
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Nonabelian group 

In mathematics, a non-abelian group, also sometimes called a non-commutative group, is a group (G, * ) in which 
there are at least two elements a and b of G such that a * b * b * a.^ ^ The term non-abelian is used to distinguish 
from the idea of an abelian group, where all of the elements of the group commute. 

Non-abelian groups are pervasive in mathematics and physics. One of the simplest examples of a non-abelian group 
is the dihedral group of order 6. A common example from physics is the rotation group in three dimensions. 

Both discrete groups and continuous groups may be non-abelian; most of the interesting Lie groups are non-abelian. 
The term non-abelian is used primarily by physicists, rather than mathematicians, and is frequently taken to be a 
synonym for the collection of Lie groups. This usage is particularly common in gauge theory. 

Concepts in group theory 

category of groups 
subgroups, normal subgroups 
group homomorphisms, kernel, image, quotient 
direct product, direct sum 
semidirect product, wreath product 
Types of groups 
simple, finite, infinite 
discrete, continuous 
multiplicative, additive 
cyclic, abelian, dihedral 

nilpotent, solvable 
list of group theory topics 
glossary of group theory 
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Higher Dimensional Algebra 



Algebraic Topology 

Algebraic topology is a branch of mathematics which uses tools from abstract algebra to study topological spaces. 
The basic goal is to find algebraic invariants that classify topological spaces up to homeomorphism, though usually 
most classify up to homotopy equivalence. 

Although algebraic topology primarily uses algebra to study topological problems, using topology to solve algebraic 
problems is sometimes also possible. Algebraic topology, for example, allows for a convenient proof that any 
subgroup of a free group is again a free group. 

The method of algebraic invariants 

An older name for the subject was combinatorial topology, implying an emphasis on how a space X was constructed 
from simpler ones (the modern standard tool for such construction is the CW-complex). The basic method now 
applied in algebraic topology is to investigate spaces via algebraic invariants by mapping them, for example, to 
groups which have a great deal of manageable structure in a way that respects the relation of homeomorphism (or 
more general homotopy) of spaces. This allows one to recast statements about topological spaces into statements 
about groups, which are often easier to prove. 

Two major ways in which this can be done are through fundamental groups, or more generally homotopy theory, and 
through homology and cohomology groups. The fundamental groups give us basic information about the structure of 
a topological space, but they are often nonabelian and can be difficult to work with. The fundamental group of a 
(finite) simplicial complex does have a finite presentation. 

Homology and cohomology groups, on the other hand, are abelian and in many important cases finitely generated. 
Finitely generated abelian groups are completely classified and are particularly easy to work with. 

Setting in category theory 

In general, all constructions of algebraic topology are functorial; the notions of category, functor and natural 
transformation originated here. Fundamental groups and homology and cohomology groups are not only invariants 
of the underlying topological space, in the sense that two topological spaces which are homeomorphic have the same 
associated groups, but their associated morphisms also correspond — a continuous mapping of spaces induces a 
group homomorphism on the associated groups, and these homomorphisms can be used to show non-existence (or, 
much more deeply, existence) of mappings. 

Results on homology 

Several useful results follow immediately from working with finitely generated abelian groups. The free rank of the 
n-th homology group of a simplicial complex is equal to the n-th Betti number, so one can use the homology groups 
of a simplicial complex to calculate its Euler-Poincare characteristic. As another example, the top-dimensional 
integral homology group of a closed manifold detects orientability: this group is isomorphic to either the integers or 
0, according as the manifold is orientable or not. Thus, a great deal of topological information is encoded in the 
homology of a given topological space. 

Beyond simplicial homology, which is defined only for simplicial complexes, one can use the differential structure 
of smooth manifolds via de Rham cohomology, or Cech or sheaf cohomology to investigate the solvability of 
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differential equations defined on the manifold in question. De Rham showed that all of these approaches were 
interrelated and that, for a closed, oriented manifold, the Betti numbers derived through simplicial homology were 
the same Betti numbers as those derived through de Rham cohomology. This was extended in the 1950s, when 
Eilenberg and Steenrod generalized this approach. They defined homology and cohomology as functors equipped 
with natural transformations subject to certain axioms (e.g., a weak equivalence of spaces passes to an isomorphism 
of homology groups), verified that all existing (co)homology theories satisfied these axioms, and then proved that 
such an axiomatization uniquely characterized the theory. 

A new approach uses a functor from filtered spaces to crossed complexes defined directly and homotopically using 
relative homotopy groups; a higher homotopy van Kampen theorem proved for this functor enables basic results in 
algebraic topology, especially on the border between homology and homotopy, to be obtained without using singular 
homology or simplicial approximation. This approach is also called nonabelian algebraic topology, and generalises 
to higher dimensions ideas coming from the fundamental group. 

Applications of algebraic topology 

Classic applications of algebraic topology include: 

• The Brouwer fixed point theorem: every continuous map from the unit n-disk to itself has a fixed point. 

• The /i-sphere admits a nowhere- vanishing continuous unit vector field if and only if n is odd. (For n = 2, this is 
sometimes called the "hairy ball theorem".) 

• The Borsuk-Ulam theorem: any continuous map from the /i-sphere to Euclidean n-space identifies at least one 
pair of antipodal points. 

• Any subgroup of a free group is free. This result is quite interesting, because the statement is purely algebraic yet 
the simplest proof is topological. Namely, any free group G may be realized as the fundamental group of a graph 
X. The main theorem on covering spaces tells us that every subgroup H of G is the fundamental group of some 
covering space Y of X\ but every such Y is again a graph. Therefore its fundamental group His free. 

On the other hand this type of application is also handled by the use of covering morphisms of groupoids, and that 
technique has yielded subgroup theorems not yet proved by methods of algebraic topology (see the book by Higgins 
listed under groupoids). 

• Topological combinatorics 

Notable algebraic topologists 

• Frank Adams 

• Karol Borsuk 

• Luitzen Egbertus Jan Brouwer 

• William Browder 

• Ronald Brown (mathematician) 

• Nicolas Bourbaki 

• Henri Cartan 

• Hermann Kunneth 

• Samuel Eilenberg 

• Hans Freudenthal 

• Peter Freyd 

• Alexander Grothendieck 

• Friedrich Hirzebruch 

• Heinz Hopf 

• Michael J. Hopkins 
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• Witold Hurewicz 

• Egbert van Kampen 

• Saunders Mac Lane 

• Jean Leray 

• Mark Mahowald 

• J. Peter May 

• John Coleman Moore 

• Sergei Novikov 

• Lev Pontryagin 

• Mikhail Postnikov 

• Daniel Quillen 

• Jean-Pierre Serre 

• Stephen Smale 

• Norman Steenrod 

• Dennis Sullivan 

• Rene Thorn 

• Leopold Vietoris 

• Hassler Whitney 

• J. H. C. Whitehead 

• Brandon Blaha 

Important theorems in algebraic topology 

• Borsuk-Ulam theorem 

• Brouwer fixed point theorem 

• Cellular approximation theorem 

• Eilenberg— Zilber theorem 

• Freudenthal suspension theorem 

• Hurewicz theorem 

• Kunneth theorem 

• Poincare duality theorem 

• Universal coefficient theorem 

• Van Kampen' s theorem 

• Generalized van Kampen' s theorems ^ 

• Higher homotopy, generalized van Kampen's theorem 1 J 

• Whitehead's theorem 

Notes 

[i] http://pianetphysics.org/encyciopedia/GeneraiizedvanKampenTheoremsHDGVKT.htmi#BHKP 

[2] R. Brown, K. A. Hardie, K. H. Kamps and T. Porter, A homotopy double groupoid of a Hausdorff space, Theory and Applications of 
Categories. 10 (2002) 71-93. http://www.emis.de/journals/TAC/volumes/14/9/14-09.pdf 
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Category Theory 



Category theory is an area of study in mathematics that examines 
in an abstract way the properties of particular mathematical 
concepts, by formalising them as collections of objects and arrows 
(also called morphisms, although this term also has a specific, non 
category-theoretical sense), where these collections satisfy certain 
basic conditions. Many significant areas of mathematics can be 
formalised as categories, and the use of category theory allows 
many intricate and subtle mathematical results in these fields to be 
stated, and proved, in a much simpler way than without the use of 
categories. 



X 



f 



Y 




9 



A category with objects X, Y, Z and morphisms/, g 



The most accessible example of a category is the Category of sets, 
where the objects are sets and the arrows are functions from one 
set to another. However it is important to note that the objects of a 
category need not be sets nor the arrows functions; any way of 

formalising a mathematical concept such that it meets the basic conditions on the behaviour of objects and arrows is 
a valid category, and all the results of category theory will apply to it. 

One of the simplest examples of a category (which is a very important concept in topology) is that of groupoid, 
defined as a category whose arrows or morphisms are all invertible. Categories now appear in most branches of 
mathematics, some areas of theoretical computer science where they correspond to types, and mathematical physics 
where they can be used to describe vector spaces. Categories were first introduced by Samuel Eilenberg and 
Saunders Mac Lane in 1942-45, in connection with algebraic topology. 

Category theory has several faces known not just to specialists, but to other mathematicians. A term dating from the 
1940s, "general abstract nonsense", refers to its high level of abstraction, compared to more classical branches of 
mathematics. Homological algebra is category theory in its aspect of organising and suggesting manipulations in 
abstract algebra. Diagram chasing is a visual method of arguing with abstract "arrows" joined in diagrams. Note that 
arrows between categories are called functors, subject to specific defining commutativity conditions; moreover, 
categorical diagrams and sequences can be defined as functors (viz. Mitchell, 1965). An arrow between two functors 
is a natural transformation when it is subject to certain naturality or commutativity conditions. Functors and natural 
transformations ('naturality') are the key concepts in category theory^ . Topos theory is a form of abstract sheaf 
theory, with geometric origins, and leads to ideas such as pointless topology. A topos can also be considered as a 
specific type of category with two additional topos axioms. 



Background 

The study of categories is an attempt to axiomatically capture what is commonly found in various classes of related 
mathematical structures by relating them to the structure-preserving functions between them. A systematic study of 
category theory then allows us to prove general results about any of these types of mathematical structures from the 
axioms of a category. 

Consider the following example. The class Grp of groups consists of all objects having a "group structure". One can 
proceed to prove theorems about groups by making logical deductions from the set of axioms. For example, it is 
immediately proved from the axioms that the identity element of a group is unique. 

Instead of focusing merely on the individual objects (e.g., groups) possessing a given structure, category theory 
emphasizes the morphisms - the structure-preserving mappings - between these objects; by studying these 
morphisms, we are able to learn more about the structure of the objects. In the case of groups, the morphisms are the 
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group homomorphisms. A group homomorphism between two groups "preserves the group structure" in a precise 
sense - it is a "process" taking one group to another, in a way that carries along information about the structure of 
the first group into the second group. The study of group homomorphisms then provides a tool for studying general 
properties of groups and consequences of the group axioms. 

A similar type of investigation occurs in many mathematical theories, such as the study of continuous maps 
(morphisms) between topological spaces in topology (the associated category is called Top), and the study of smooth 
functions (morphisms) in manifold theory. 

If one axiomatizes relations instead of functions, one obtains the theory of allegories. 
Functors 

Abstracting again, a category is itself a type of mathematical structure, so we can look for "processes" which 
preserve this structure in some sense; such a process is called a functor. A functor associates to every object of one 
category an object of another category, and to every morphism in the first category a morphism in the second. 

In fact, what we have done is define a category of categories and functors - the objects are categories, and the 
morphisms (between categories) are functors. 

By studying categories and functors, we are not just studying a class of mathematical structures and the morphisms 
between them; we are studying the relationships between various classes of mathematical structures. This is a 
fundamental idea, which first surfaced in algebraic topology. Difficult topological questions can be translated into 
algebraic questions which are often easier to solve. Basic constructions, such as the fundamental group or 
fundamental groupoid of a topological space, can be expressed as fundamental functors to the category of 
groupoids in this way, and the concept is pervasive in algebra and its applications. 

Natural transformation 

Abstracting yet again, constructions are often "naturally related" - a vague notion, at first sight. This leads to the 
clarifying concept of natural transformation, a way to "map" one functor to another. Many important constructions in 
mathematics can be studied in this context. "Naturality" is a principle, like general covariance in physics, that cuts 
deeper than is initially apparent. 

Historical notes 

In 1942-45, Samuel Eilenberg and Saunders Mac Lane introduced categories, functors, and natural transformations 
as part of their work in topology, especially algebraic topology. Their work was an important part of the transition 
from intuitive and geometric homology to axiomatic homology theory. Eilenberg and Mac Lane later wrote that their 
goal was to understand natural transformations; in order to do that, functors had to be defined, which required 
categories. 

Stanislaw Ulam, and some writing on his behalf, have claimed that related ideas were current in the late 1930s in 
Poland. Eilenberg was Polish, and studied mathematics in Poland in the 1930s. Category theory is also, in some 
sense, a continuation of the work of Emmy Noether (one of Mac Lane's teachers) in formalizing abstract processes; 
Noether realized that in order to understand a type of mathematical structure, one needs to understand the processes 
preserving that structure. In order to achieve this understanding, Eilenberg and Mac Lane proposed an axiomatic 
formalization of the relation between structures and the processes preserving them. 

The subsequent development of category theory was powered first by the computational needs of homological 
algebra, and later by the axiomatic needs of algebraic geometry, the field most resistant to being grounded in either 
axiomatic set theory or the Russell- Whitehead view of united foundations. General category theory, an extension of 
universal algebra having many new features allowing for semantic flexibility and higher-order logic, came later; it is 
now applied throughout mathematics. 
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Certain categories called topoi (singular topos) can even serve as an alternative to axiomatic set theory as a 
foundation of mathematics. These foundational applications of category theory have been worked out in fair detail as 
a basis for, and justification of, constructive mathematics. More recent efforts to introduce undergraduates to 
categories as a foundation for mathematics include Lawvere and Rosebrugh (2003) and Lawvere and Schanuel 
(1997). 

Categorical logic is now a well-defined field based on type theory for intuitionistic logics, with applications in 
functional programming and domain theory, where a cartesian closed category is taken as a non-syntactic description 
of a lambda calculus. At the very least, category theoretic language clarifies what exactly these related areas have in 
common (in some abstract sense). 

Categories, objects, and morphisms 

A category C consists of the following three mathematical entities: 

• A class ob(C), whose elements are called objects; 

• A class hom(C), whose elements are called morphisms or maps or arrows. Each morphism/has a unique source 
object a and target object b. We write/: a —> b, and we say "/is a morphism from a to b" . We write hom(a, b) (or 
Hom(a, b), or hom c (a, b), or Mor(<2, b), or C(a, b)) to denote the hom-class of all morphisms from a to b. 

• A binary operation o , called composition of morphisms, such that for any three objects a, b, and c, we have 
hom(a, b) x hom(Z?, c) —> hom(a, c). The composition of/: a — » b and g: b — » c is written as g o for gf^ , 
governed by two axioms: 

• Associativity: If/: a^>b,g\b^>c and h: c — » d then h o [g o /) = (h o g) o / , and 

• Identity: For every object x, there exists a morphism 1^ : x —> x called the identity morphism for x, such that for 
every morphism/: a—^b, we have l b of = f = fol a . 

From these axioms, it can be proved that there is exactly one identity morphism for every object. Some authors 
deviate from the definition just given by identifying each object with its identity morphism. 

Relations among morphisms (such as/g = h) are often depicted using commutative diagrams, with "points" (corners) 
representing objects and "arrows" representing morphisms. 

Properties of morphisms 

Some morphisms have important properties. A morphism/: a — » b is: 

• a monomorphism (or monic) if fog ^ =fog 2 implies g 1 = g 2 for all morphisms g^ g 2 '.x^> a. 

• an epimorphism (or epic) if g o/= g^f implies g 1 = g 2 for all morphisms g , g : b —> x. 

• an isomorphism if there exists a morphism g : b —> a with/og = 1 and gof= 1 

• an endomorphism if a = b. end(a) denotes the class of endomorphisms of a. 

• an automorphism iff is both an endomorphism and an isomorphism, aut(a) denotes the class of automorphisms of 
a. 
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Functors 

Functors are structure-preserving maps between categories. They can be thought of as morphisms in the category of 
all (small) categories. 

A (covariant) functor F from a category C to a category D, written F:C —> D, consists of: 

• for each object x in C, an object F(x) in D; and 

• for each morphism/ : x — » y in C, a morphism F(f) : F(x) —> F(y), 

such that the following two properties hold: 

• For every object x in C, F(\^ = 1 ; 

• For all morphisms/: jc — » v and g : v — » z, o /) = ^(5) o F(f). 

A contravariant functor F: C —> D, is like a covariant functor, except that it "turns morphisms around" ("reverses all 
the arrows"). More specifically, every morphism/: x —> y in C must be assigned to a morphism F(f) : F(y) —> F(x) in 
D. In other words, a contravariant functor is a covariant functor from the opposite category C op to D. 



Natural transformations and isomorphisms 



A natural transformation is a relation between two functors. Functors often describe "natural constructions" and 
natural transformations then describe "natural homomorphisms" between two such constructions. Sometimes two 
quite different constructions yield "the same" result; this is expressed by a natural isomorphism between the two 
functors. 

If F and G are (covariant) functors between the categories C and £>, then a natural transformation n from F to G 
associates to every object xinCa morphism : F(x) — » G(x) in D such that for every morphism / : x —> y in C, we 

have T| o F(/) = G(/) o r| ; this means that the following diagram is commutative: 

y x 



F(X) 



Vx 

G(X) 



F(J) 



G(f) 



F(Y) 



The two functors F and G are called naturally isomorphic if there exists a natural transformation from F to G such 
that r] is an isomorphism for every object x in C. 



Universal constructions, limits, and colimits 

Using the language of category theory, many areas of mathematical study can be cast into appropriate categories, 
such as the categories of all sets, groups, topologies, and so on. These categories surely have some objects that are 
"special" in a certain way, such as the empty set or the product of two topologies, yet in the definition of a category, 
objects are considered to be atomic, i.e., we do not know whether an object A is a set, a topology, or any other 
abstract concept - hence, the challenge is to define special objects without referring to the internal structure of those 
objects. But how can we define the empty set without referring to elements, or the product topology without 
referring to open sets? 

The solution is to characterize these objects in terms of their relations to other objects, as given by the morphisms of 
the respective categories. Thus, the task is to find universal properties that uniquely determine the objects of interest. 
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Indeed, it turns out that numerous important constructions can be described in a purely categorical way. The central 
concept which is needed for this purpose is called categorical limit, and can be dualized to yield the notion of a 
colimit. 

Equivalent categories 

It is a natural question to ask: under which conditions can two categories be considered to be "essentially the same", 
in the sense that theorems about one category can readily be transformed into theorems about the other category? 
The major tool one employs to describe such a situation is called equivalence of categories, which is given by 
appropriate functors between two categories. Categorical equivalence has found numerous applications in 
mathematics. 

Further concepts and results 

The definitions of categories and functors provide only the very basics of categorical algebra; additional important 
topics are listed below. Although there are strong interrelations between all of these topics, the given order can be 
considered as a guideline for further reading. 

• The functor category D has as objects the functors from C to D and as morphisms the natural transformations of 
such functors. The Yoneda lemma is one of the most famous basic results of category theory; it describes 
representable functors in functor categories. 

• Duality: Every statement, theorem, or definition in category theory has a dual which is essentially obtained by 
"reversing all the arrows". If one statement is true in a category C then its dual will be true in the dual category 
C op . This duality, which is transparent at the level of category theory, is often obscured in applications and can 
lead to surprising relationships. 

• Adjoint functors: A functor can be left (or right) adjoint to another functor that maps in the opposite direction. 
Such a pair of adjoint functors typically arises from a construction defined by a universal property; this can be 
seen as a more abstract and powerful view on universal properties. 

Higher-dimensional categories 

Many of the above concepts, especially equivalence of categories, adjoint functor pairs, and functor categories, can 
be situated into the context of higher-dimensional categories. Briefly, if we consider a morphism between two 
objects as a "process taking us from one object to another", then higher-dimensional categories allow us to profitably 
generalize this by considering "higher-dimensional processes". 

For example, a (strict) 2-category is a category together with "morphisms between morphisms", i.e., processes which 
allow us to transform one morphism into another. We can then "compose" these "bimorphisms" both horizontally 
and vertically, and we require a 2-dimensional "exchange law" to hold, relating the two composition laws. In this 
context, the standard example is Cat, the 2-category of all (small) categories, and in this example, bimorphisms of 
morphisms are simply natural transformations of morphisms in the usual sense. Another basic example is to consider 
a 2-category with a single object; these are essentially monoidal categories. Bicategories are a weaker notion of 
2-dimensional categories in which the composition of morphisms is not strictly associative, but only associative "up 
to" an isomorphism. 

This process can be extended for all natural numbers n, and these are called /i-categories. There is even a notion of 
co -category corresponding to the ordinal number oo. 

Higher-dimensional categories are part of the broader mathematical field of higher-dimensional algebra, a concept 
introduced by Ronald Brown. For a conversational introduction to these ideas, see John Baez, 'A Tale of 
^-categories' (1996). [5] 
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See also 

• Domain theory 

• Enriched category theory 

• Glossary of category theory 

• Higher category theory 

• Higher-dimensional algebra 

• Important publications in category theory 

• Timeline of category theory and related mathematics 

Notes 

[1] Categories for the Working Mathematician, 2nd Edition, p 18: "As Eilenberg-Mac Lane first observed, 'category' has been defined in order to 

be able to define 'functor' and 'functor' has been defined in order to be able to define 'natural transformation' ". 
[2] http://planetphysics.org/encyclopedia/FundamentalGroupoidFunctor.html 

[3] Some authors compose in the opposite order, writing fg or f O Q for g O f . Computer scientists using category theory very commonly 
write fig for g O f 

[4] Note that a morphism that is both epic and monic is not necessarily an isomorphism! For example, in the category consisting of two objects A 

and B, the identity morphisms, and a single morphism/ from A to B, /is both epic and monic but is not an isomorphism. 
[5 ] http : / / math . ucr . edu/home/baez/ week7 3 . html 
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Double Groupoid 

In mathematics, especially in higher-dimensional algebra and homotopy theory, a double groupoid generalises the 
notion of groupoid and of category to a higher dimension. 

Definition 

A double groupoid D is a higher-dimensional groupoid involving a relationship for both v horizontal' and v vertical' 

groupoid structures^ . (A double groupoid can also be considered as a generalization of certain higher-dimensional 

Hi 

groups .) The geometry of squares and their compositions leads to a common representation of a double groupoid 
in the following diagram: 




where M is a set of "points', H and V are, respectively, "horizontal' and "vertical' groupoids, and S is a set of "squares' 
with two compositions. The composition laws for a double groupoid D make it also describable as a groupoid 
internal to the category of groupoids. 

Given two groupoids H, V over a set M, there is a double groupoid V) with H,V as horizontal and vertical 

edge groupoids, and squares given by quadruples 



h 

ti 



for which we assume always that h, h' are in H, v, v' are in V, and that the initial and final points of these edges 
match in M as suggested by the notation, that is for example sh = sv, th = sv',..., etc. The compositions are to be 
inherited from those of H,V, that is: 







( k' ) 








w w* 


H 
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This construction is the right adjoint to the forgetful functor which takes the double groupoid as above, to the pair of 
groupoids H,V over M. 

Other related constructions are that of a double groupoid with connection ^ and homotopy double groupoids ^ . 

Convolution algebra 

A convolution C* -algebra of a double groupoid can also be constructed by employing the square diagram D of a 
double groupoid ^ . 



Double Groupoid Category 

The category whose objects are double groupoids and whose morphisms are double groupoid homomorphisms that 
are double groupoid diagram (D) functors is called the double groupoid category, or the category of double 
groupoids. 



Notes 

[1] Brown, Ronald and C.B. Spencer: "Double groupoids and crossed modules.", Cahiers Top. Geom. Diff.. 17 (1976), 343-362 

[2] Brown, Ronald,, Higher-dimensional group theory (http://www.bangor.ac.Uk/r.brown/hdaweb2.htm) Explains how the groupoid concept 

has to led to higher-dimensional homotopy groupoids, having applications in homotopy theory and in group cohomology 
[3] http://planetphysics.org/encyclopedia/DoubleGroupoidWithConnection.html Double Groupoid with Connection 

[4] Brown, R., Hardie, K., Kamps, H. and T. Porter: 2002, "The homotopy double groupoid of a Hausdorff space.", Theory and Applications of 
Categories: 10, 71-93 

[5] http://planetphysics.org/encyclopedia/DoubleGroupoidGeometry.html Double Groupoid Geometry 

This article incorporates material from higher dimensional algebra (http:/ / planetmath. org/ encyclopedia/ 
ExtensionOfAlgebraicTopology. html), which is licensed under the Creative Commons Attribution/Share -Alike 
License. 
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Higher dimensional algebra 

This article is about higher-dimensional algebra and super categories in generalized category theory, 
super-category theory, and also its extensions in nonabelian algebraic topology and metamathematics^ . 

Supercategories were first introduced in 1970, and were subsequently developed for applications in theoretical 
physics (especially quantum field theory and topological quantum field theory) and mathematical biology or 
mathematical biophysics. 



Double groupoids, fundamental groupoids, 2-categories, categorical QFTs and 
TQFTs 

In higher-dimensional algebra (HDA), a double groupoid is a generalisation of a one-dimensional groupoid to two 
dimensions^ , and the latter groupoid can be considered as a special case of a category with all invertible arrows, or 
morphisms. 

Double groupoids are often used to capture information about geometrical objects such as higher-dimensional 
manifolds (or ^-dimensional manifolds) ^ . In general, an ^-dimensional manifold is a space that locally looks like 
an ^-dimensional Euclidean space, but whose global structure may be non-Euclidean. A first step towards defining 
higher dimensional algebras is the concept of 2-category of higher category theory, followed by the more "geometric' 
concept of double category ^ . Other pathways in HDA involve: bicategories, homomorphisms of bicategories, 
variable categories (aka, indexed, or parametrized categories), topoi, effective descent, enriched and internal 
categories, as well as quantum categories 1 ^ tl0] and quantum double groupoids^ 12 ^ . In the latter case, by 
considering fundamental groupoids defined via a 2-functor allows one to think about the physically interesting case 
of quantum fundamental groupoids (QFGs) in terms of the bicategory Span(Groupoids), and then constructing 
2-Hilbert spaces and 2-linear maps for manifolds and cobordisms. At the next step, one obtains cobordisms with 
corners via natural transformations of such 2-functors. A claim was then made that, with the gauge group SU(2), 

ri3i 

"the extended TQFT, or ETQFT, gives a theory equivalent to the Ponzano-Regge model of quantum gravity' ; 
similarly, the Turaev-Viro model would be then obtained with representations of SU_q(2). Therefore, according to 
the construction proposed by Jeffrey Morton, one can describe the state space of a gauge theory - or many kinds of 
quantum field theories (QFTs) and local quantum physics, in terms of the transformation groupoids given by 
symmetries, as for example in the case of a gauge theory, by the gauge transformations acting on states that are, in 
this case, connections. In the case of symmetries related to quantum groups, one would obtain structures that are 
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representation categories of quantum groupoids , instead of the 2- vector spaces that are representation categories 
of groupoids. 



Double categories, Category of categories and Supercategories 

A higher level concept is thus defined as a category of categories, or super-category, which generalises to higher 
dimensions the notion of category - regarded as any structure which is an interpretation of Lawvere's axioms of the 
elementary theory of abstract categories (ETAC)^ 15 ^ ^ ^ ^ . Thus, a supercategory and also a super-category, 
can be regarded as natural extensions of the concepts of meta-categoryj 19 ^ multicategory, and multi-graph, k-partite 
graph, or colored graph (see a color figure, and also its definition in graph theory). 

Double groupoids were first introduced by Ronald Brown in 1976, in refJ 20 -' and were further developed towards 

T211 T221 T231 T241 

applications in nonabelian algebraic topology 1 . A related, 'dual' concept is that of a double algebroid, 

and the more general concept of R-algebroid. 



Nonabelian algebraic topology 

Many of the higher dimensional algebraic structures are noncommutative and, therefore, their study is a very 

significant part of nonabelian category theory, and also of Nonabelian Algebraic Topology (NAAT) L which 

[27] 

generalises to higher dimensions ideas coming from the fundamental group . Such algebraic structures in 
dimensions greater than 1 develop the nonabelian character of the fundamental group, and they are in a precise sense 
more nonabelian than the groups' . These noncommutative, or more specifically, nonabelian structures 

reflect more accurately the geometrical complications of higher dimensions than the known homology and homotopy 
groups commonly encountered in classical algebraic topology. An important part of nonabelian algebraic topology is 
concerned with the properties and applications of homotopy groupoids and filtered spaces. Noncommutative double 
groupoids and double algebroids are only the first examples of such higher dimensional structures that are 
nonabelian. The new methods of Nonabelian Algebraic Topology (NAAT) "can be applied to determine homotopy 
invariants of spaces, and homotopy classification of maps, in cases which include some classical results, and allow 
results not available by classical methods" ^ . Cubical omega-groupoids, higher homotopy groupoids, crossed 
modules, crossed complexes and Galois groupoids are key concepts in developing applications related to homotopy 
of filtered spaces, higher dimensional space structures, the construction of the fundamental groupoid of a topos E in 
the general theory of topoi, and also in their physical applications in nonabelian quantum theories, and recent 
developments in quantum gravity, as well as categorical and topological dynamics . Further examples of such 
applications include the generalisations of noncommutative geometry formalizations of the noncommutative 
standard models via fundamental double groupoids and spacetime structures even more general than topoi or the 
lower-dimensional noncommutative spacetimes encountered in several topological quantum field theories and 
noncommutative geometry theories of quantum gravity. 

A fundamental result in NAAT is the generalised, higher homotopy van Kampen theorem proven by R. Brown 

which states that "the homotopy type of a topological space can be computed by a suitable colimit or homotopy 

colimit over homotopy types of its pieces". A related example is that of van Kampen theorems for categories of 

covering morphisms in lextensive categories . Other reports of generalisations of the van Kampen theorem 

T331 

include statements for 2-categories and a topos of topoi [34]. Important results in HDA are also the extensions of 
the Galois theory in categories and variable categories, or indexed/" parametrized' categories ^ ^ 36 ^ . The 
Joyal-Tierney representation theorem for topoi is also a generalisation of the Galois theory . Thus, indexing by 

T381 

bicategories in the sense of Benabou one also includes here the Joyal-Tierney theory 
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Higher category theory 

Higher category theory is the part of category theory at a higher order, which means that some equalities are 
replaced by explicit arrows in order to be able to explicitly study the structure behind those equalities. 

Strict higher categories 

N-categories are defined inductively using the enriched category theory: 0-categories are sets, and (n+l)-categories 
are categories enriched over the monoidal category of n-categories (with the monoidal structure given by finite 
products) This construction is well defined, as shown in the article on n-categories. This concept introduces 
higher arrows, higher compositions and higher identities, which must well behave together. For example, the 
category of small categories is in fact a 2-category, with natural transformations as second degree arrows. However 
this concept is too strict for some purposes (for example, homotopy theory), where "weak" structures arise in the 
form of higher categories. 

Weak higher categories 

In weak n-categories, the associativity and identity conditions are no longer strict (that is, they are not given by 
equalities), but rather are satisfied up to an isomorphism of the next level. An example in topology is the 
composition of paths, which is associative only up to homotopy. These isomorphisms must well behave between 
hom-sets and expressing this is the difficulty in the definition of weak n-categories. Weak 2-categories, also called 
bicategories, were the first to be defined explicitly. A particularity of these is that a bicategory with one object is 
exactly a monoidal category, so that bicategories can be said to be "monoidal categories with many objects." Weak 
3 -categories, also called tricategories, and higher-level generalizations are increasingly harder to define explicitly. 
Several definitions have been given, and telling when they are equivalent, and in what sense, has become a new 
object of study in category theory. 

Quasicategories 

Weak Kan complexes, or quasi-categories, are semisimplicial complexes satisfying a weak version of the Kan 
condition. Joyal showed that they are a good foundation for higher category theory. Recently the theory has been 
sistematized further by Jacob Lurie who simply call them infinity categories, though the latter term is also a generic 
term for all models of (infinity ,k) categories for any k. 

Simplicially enriched category 

Simplicially enriched categories, or simplicial categories, are categories enriched over simplicial sets. However, 
when we look at them as a model for (infinity, l)-categories, then many categorical notions, say limits do not agree 
with the corresponding notions in the sense of enriched categories. The same for other enriched models like 
topologically enriched categories. 
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Topologically enriched categories 

Topologically enriched categories (sometimes simply topological categories) are categories enriched over some 
convenient category of topological spaces, e.g. the category of compactly generated Hausdorff topological spaces. 

Segal categories 

These are models of higher categories introduced by Hirschowitz and Simpson in 1988 , partly inspired by results 
of Graeme Segal in 1974. 
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Duality (mathematics) 

In mathematics, duality has numerous meanings, and although it is "a very pervasive and important concept in 
(modern) mathematics"^ and "an important general theme that has manifestations in almost every area of 
mathematics", there is no single universally agreed definition that unifies all concepts of duality. 

Generally speaking, a duality translates concepts, theorems or mathematical structures into other concepts, theorems 
or structures, in a one-to-one fashion, often (but not always) by means of an involution operation: if the dual of A is 
B, then the dual of B is A. As involutions sometimes have fixed points, the dual of A is sometimes A itself. For 
example, Desargues' theorem in projective geometry is self-dual in this sense. 

Many mathematical dualities between objects of two types correspond to pairings, bilinear functions from an object 
of one type and another object of the second type to some family of scalars. For instance, linear algebra duality 
corresponds in this way to bilinear maps from pairs of vector spaces to scalars, the duality between distributions and 
the associated test functions corresponds to the pairing in which one integrates a distribution against a test function, 
and Poincare duality corresponds similarly to intersection number, viewed as a pairing between submanifolds of a 
given manifold. 



Order-reversing dualities 

A particularly simple form of duality comes from order theory. The dual of a poset P = (X, <) is the poset P d = (X, >) 
comprising the same ground set but the converse relation. Familiar examples of dual partial orders include 

• the subset and superset relations C and D on any collection of sets, 

• the divides and multiple -of relations on the integers, and 

• the descendant-of and ancestor-of relations on the set of humans. 

A concept defined for a partial order P will correspond to a dual concept on the dual poset P d . For instance, a 
minimal element of P will be a maximal element of P d \ minimality and maximality are dual concepts in order theory. 
Other pairs of dual concepts are upper and lower bounds, lower sets and upper sets, and ideals and filters. 

A particular order reversal of this type occurs in the family of all subsets of some set S: if A = S \ A denotes the 
complement set, then A C B if and only if Q £ A • I n topology, open sets and closed sets are dual concepts: the 
complement of an open set is closed, and vice versa. In matroid theory, the family of sets complementary to the 
independent sets of a given matroid themselves form another matroid, called the dual matroid. In logic, one may 
represent a truth assignment to the variables of an unquantified formula as a set, the variables that are true for the 
assignment. A truth assignment satisfies the formula if and only if the complementary truth assignment satisfies the 

De Morgan dual of its formula. The existential and universal quantifiers in logic are similarly dual. 

A partial order may be interpreted as a category in which there is an arrow from x to y in the category if and only if 

x < y in the partial order. The order-reversing duality of partial orders can be extended to the concept of a dual 

category, the category formed by reversing all the arrows in a given category. Many of the specific dualities 

described later are dualities of categories in this sense. 

According to Artstein-Avidan and Milman,^ ^ a duality transform is just an involutive antiautomorphism 7~of a 
partially ordered set S, that is, an order reversing involution T \ S — > S. Surprisingly, in several important cases 
these simple properties determine the transform uniquely up to some simple symmetries. If 7i, T^are two duality 
transforms then their composition is an order automorphism of S; thus, any two duality transforms differ only by an 
order automorphism. For example, all order automorphisms of a power set S=2 are induced by permutations of R. 
The papers cited above treat only sets S of functions on R n satisfying some condition of convexity and prove that all 
order automorphisms are induced by linear or affine transformations of R n . 
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Dimension-reversing dualities 



There are many distinct but interrelated dualities in which 
geometric or topological objects correspond to other objects of the 
same type, but with a reversal of the dimensions of the features of 
the objects. A classical example of this is the duality of the 
platonic solids, in which the cube and the octahedron form a dual 
pair, the dodecahedron and the icosahedron form a dual pair, and 
the tetrahedron is self-dual. The dual polyhedron of any of these 
polyhedra may be formed as the convex hull of the center points of 
each face of the primal polyhedron, so the vertices of the dual 
correspond one-for-one with the faces of the primal. Similarly, 
each edge of the dual corresponds to an edge of the primal, and 
each face of the dual corresponds to a vertex of the primal. These 
correspondences are incidence-preserving: if two parts of the 
primal polyhedron touch each other, so do the corresponding two 
parts of the dual polyhedron. More generally, using the concept of 
polar reciprocation, any convex polyhedron, or more generally any 

convex polytope, corresponds to a dual polyhedron or dual polytope, with an /-dimensional feature of an 
^-dimensional polytope corresponding to an (n - i - l)-dimensional feature of the dual polytope. The 
incidence-preserving nature of the duality is reflected in the fact that the face lattices of the primal and dual 
polyhedra or polytopes are themselves order- theoretic duals. Duality of poly topes and order-theoretic duality are 
both involutions: the dual polytope of the dual polytope of any polytope is the original polytope, and reversing all 
order-relations twice returns to the original order. Choosing a different center of polarity leads to geometrically 
different dual polytopes, but all have the same combinatorial structure. 




The features of the cube and its dual octahedron 
correspond one-for-one with dimensions reversed. 




From any three-dimensional polyhedron, one can form a 
planar graph, the graph of its vertices and edges. The dual 
polyhedron has a dual graph, a graph with one vertex for 
each face of the polyhedron and with one edge for every 
two adjacent faces. The same concept of planar graph 
duality may be generalized to graphs that are drawn in the 
plane but that do not come from a three-dimensional 
polyhedron, or more generally to graph embeddings on 
surfaces of higher genus: one may draw a dual graph by 
placing one vertex within each region bounded by a cycle 
of edges in the embedding, and drawing an edge 
connecting any two regions that share a boundary edge. 

An important example of this type comes from computational geometry: the duality for any finite set S of points in 
the plane between the Delaunay triangulation of S and the Voronoi diagram of S. As with dual polyhedra and dual 
polytopes, the duality of graphs on surfaces is a dimension-reversing involution: each vertex in the primal embedded 
graph corresponds to a region of the dual embedding, each edge in the primal is crossed by an edge in the dual, and 
each region of the primal corresponds to a vertex of the dual. The dual graph depends on how the primal graph is 
embedded: different planar embeddings of a single graph may lead to different dual graphs. Matroid duality is an 
algebraic extension of planar graph duality, in the sense that the dual matroid of the graphic matroid of a planar 
graph is isomorphic to the graphic matroid of the dual graph. 



A planar graph, G, and its dual graph, G'. 
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In topology, Poincare duality also reverses dimensions; it corresponds to the fact that, if a topological manifold is 
respresented as a cell complex, then the dual of the complex (a higher dimensional generalization of the planar graph 
dual) represents the same manifold. In Poincare duality, this homeomorphism is reflected in an isomorphism of the 
Mi homology group and the (n - &)th cohomology group. 

Another example of a dimension-reversing duality arises 
in projective geometry J 6 ^ In the projective plane, it is 
possible to find geometric transformations that map each 
point of the projective plane to a line, and each line of the 
projective plane to a point, in an incidence-preserving 
way: in terms of the incidence matrix of the points and 

lines in the plane, this operation is just that of forming the 

m r p ,1 • . . t The complete quadrangle, a configuration of four points and six 

transpose. Transformations of this type exist also in any 

lines in the projective plane (left) and its dual configuration, the 

higher dimension; one way to construct them is to use the complete quadrilateral? with four lines and six points (right)> 
same polar transformations that generate polyhedron and 

poly tope duality. Due to this ability to replace any configuration of points and lines with a corresponding 
configuration of lines and points, there arises a general principle of duality in projective geometry: given any 
theorem in plane projective geometry, exchanging the terms "point" and "line" everywhere results in a new, equally 
valid theorem. 

The points, lines, and higher dimensional subspaces ^-dimensional projective space may be interpreted as describing 
the linear subspaces of an (n + l)-dimensional vector space; if this vector space is supplied with an inner product the 
transformation from any linear subspace to its perpendicular subspace is an example of a projective duality. The 
Hodge dual extends this duality within an inner product space by providing a canonical correspondence between the 
elements of the exterior algebra. 

A kind of geometric duality also occurs in optimization theory, but not one that reverses dimensions. A linear 
program may be specified by a system of real variables (the coordinates for a point in Euclidean space R w ), a system 
of linear constraints (specifying that the point lie in a half space; the intersection of these half spaces is a convex 
polytope, the feasible region of the program), and a linear function (what to optimize). Every linear program has a 
dual problem with the same optimal solution, but the variables in the dual problem correspond to constraints in the 
primal problem and vice versa. 




Duality in logic and set theory 

In logic, functions or relations A and B are considered dual if A(-ot) = -*B(x), where -> is logical negation. The basic 
duality of this type is the duality of the 3 and V quantifiers. These are dual because 3x.-*P(x) and -Nx.P(x) are 
equivalent for all predicates P: if there exists an x for which P fails to hold, then it is false that P holds for all x. From 
this fundamental logical duality follow several others: 

• A formula is said to be satisfiable in a certain model if there are assignments to its free variables that render it 
true; it is valid if every assignment to its free variables makes it true. Satisfiability and validity are dual because 
the invalid formulas are precisely those whose negations are satisfiable, and the unsatisfiable formulas are those 
whose negations are valid. This can be viewed as a special case of the previous item, with the quantifiers ranging 
over interpretations. 

• In classical logic, the a and v operators are dual in this sense, because (px a -ry) and -i(jc v y) are equivalent. This 
means that for every theorem of classical logic there is an equivalent dual theorem. De Morgan's laws are 
examples. More generally, f\{~ ] Xi) = — ■ \f Xi . The left side is true if and only if V/.-a;., and the right side if 

and only if ->3z.x.. 
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• In modal logic, Op means that the proposition p is "necessarily" true, and ()p that p is "possibly" true. Most 
interpretations of modal logic assign dual meanings to these two operators. For example in Kripke semantics, "p 
is possibly true" means "there exists some world Win which p is true", while "p is necessarily true" means "for all 
worlds W, p is true". The duality of □ and Qthen follows from the analogous duality of V and 3. Other dual 
modal operators behave similarly. For example, temporal logic has operators denoting "will be true at some time 
in the future" and "will be true at all times in the future" which are similarly dual. 

Other analogous dualities follow from these: 

• Set-theoretic union and intersection are dual under the set complement operator . That is, 

(A C n B C ) = (A U B)° , and more generally, p| A% = ((J A^ > This follows from the duality of V 

and 3: an element x is a member of ^| A^ if and only if Va.^xGA^, and is a member of A^j* if an d only 

if -Ga.xG4 . 

a 

Topology inherits a duality between open and closed subsets of some fixed topological space X\ a subset U of X is 
closed if and only if its complement in X is open. Because of this, many theorems about closed sets are dual to 
theorems about open sets. For example, any union of open sets is open, so dually, any intersection of closed sets is 
closed. The interior of a set is the largest open set contained in it, and the closure of the set is the smallest closed set 
that contains it. Because of the duality, the complement of the interior of any set U is equal to the closure of the 
complement of U. 

The collection of all open subsets of a topological space X forms a complete Heyting algebra. There is a duality, 
known as Stone duality, connecting sober spaces and spatial locales. 

• Birkhoff s representation theorem relating distributive lattices and partial orders 



Dual objects 

A group of dualities can be described by endowing, for any mathematical object X, the set of morphisms Hom(X, D) 
into some fixed object D, with a structure similar to the one of X. This is sometimes called internal Horn. In general, 
this yields a true duality only for specific choices of D, in which case X*=Hom(X, D) is referred to as the dual of X. 
It may or may not be true that the bidual, that is to say, the dual of the dual, X** = (X*)* is isomorphic to X, as the 
following example, which is underlying many other dualities, shows: the dual vector space V* of a K- vector space V 
is defined as 

V* = Horn (V, K). 

The set of morphisms, i.e., linear maps, is a vector space in its own right. There is always a natural, injective map V 
—> V** given by v (f\->fly)), where /is an element of the dual space. That map is an isomorphism if and only if 
the dimension of V is finite. 

In the realm of topological vector spaces, a similar construction exists, replacing the dual by the topological dual 
vector space. A topological vector space that is canonically isomorphic to its bidual is called reflexive space. 

The dual lattice of a lattice L is given by 

Hom(L, Z), 

which is used in the construction of toric varieties The Pontryagin dual of locally compact topological groups G is 
given by 

Hom(G, S l ), 

continuous group homomorphisms with values in the circle (with multiplication of complex numbers as group 
operation). 
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Dual categories 

Opposite category and adjoint functors 

In another group of dualities, the objects of one theory are translated into objects of another theory and the maps 
between objects in the first theory are translated into morphisms in the second theory, but with direction reversed. 
Using the parlance of category theory, this amounts to a contravariant functor between two categories C and D: 

F.C^D 

which for any two objects X and Y of C gives a map 

Hom c (X, Y) -> Hornby), F(X)) 

That functor may or may not be an equivalence of categories. There are various situations, where such a functor is an 
equivalence between the opposite category C° p of C, and D. Using a duality of this type, every statement in the first 
theory can be translated into a "dual" statement in the second theory, where the direction of all arrows has to be 
reversed. 1 Therefore, any duality between categories C and D is formally the same as an equivalence between C 
and Z) op (C op and D). However, in many circumstances the opposite categories have no inherent meaning, which 
makes duality an additional, separate concept' 101 

Many category-theoretic notions come in pairs in the sense that they correspond to each other while considering the 
opposite category. For example, Cartesian products x Y^ and disjoint unions u Y^ of sets are dual to each other 
in the sense that 

Hom(X, Y x Y ) = Hom(X, Y ) x Hom(X, Y ) 

and 

Hom(y uY ,X) = Hom(y , X) x Hom(7 2 , X) 

for any set X. This is a particular case of a more general duality phenomenon, under which limits in a category C 
correspond to colimits in the opposite category C op ; further concrete examples of this are epimorphisms vs. 
monomorphism, in particular factor modules (or groups etc.) vs. submodules, direct products vs. direct sums (also 
called coproducts to emphasize the duality aspect). Therefore, in some cases, proofs of certain statements can be 
halved, using such a duality phenomenon. Further notions displaying related by such a categorical duality are 
projective and injective modules in homological algebra' 111 fibrations and cofibrations in topology and more 
generally model categories' 121 

Two functors F: C —> D and G\ D —> C are adjoint if for all objects c in C and d 'mD 

Hom D (F(c), d) = Hom c (c, G{d)\ 

in a natural way. Actually, the correspondence of limits and colimits is an example of adjoints, since there is an 
adjunction 

colim : C 1 <-> C : A 

between the colimit functor that assigns to any diagram in C indexed by some category / its colimit and the diagonal 
functor that maps any object c of C to the constant diagramm which has c at all places. Dually, 

A : C 1 ^ C : Urn- 
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Examples 

For example, there is a duality between commutative rings and affine schemes: to every commutative ring A there is 
an affine spectrum, Spec A, conversely, given an affine scheme S, one gets back a ring by taking global sections of 
the structure sheaf O^. In addition, ring homomorphisms are in one-to-one correspondence with morphisms of affine 
schemes, thereby there is an equivalence 

(Commutative rings) op = (affine schemes/ 13 ^ 

Compare with noncommutative geometry and Gelfand duality. 

In a number of situations, the objects of two categories linked by a duality are partially ordered, i.e., there is some 
notion of an object "being smaller" than another one. In such a situation, a duality that respects the orderings in 
question is known as a Galois connection. An example is the standard duality in Galois theory (fundamental theorem 
of Galois theory) between field extensions and subgroups of the Galois group: a bigger field extension 
corresponds — under the mapping that assigns to any extension LD K (inside some fixed bigger field £1) the Galois 
group Gal(£2 / L) — to a smaller group J 14 ^ 

Pontryagin duality gives a duality on the category of locally compact abelian groups: given any such group G, the 
character group 

X(G) = Hom(G, S l ) 

given by continuous group homomorphisms from G to the circle group S l can be endowed with the compact-open 
topology. Pontryagin duality states that the character group is again locally compact abelian and that 

G = X (X(G)). [15] 

Moreover, discrete groups correspond to compact abelian groups; finite groups correspond to finite groups. 
Pontryagin is the background to Fourier analysis, see below. 

• Tannaka-Krein duality, a non-commutative analogue of Pontryagin duality^ 16 ^ 

• Gelfand duality relating commutative C*-algebras and compact Hausdorff spaces 

ri7i 

Both Gelfand and Pontryagin duality can be deduced in a largely formal, category-theoretic way. 



Analytic dualities 

In analysis, problems are frequently solved by passing to the dual description of functions and operators. 
Fourier transform switches between functions on a vector space and its dual: 

/(£) :=/ f(x)e***dx, 

and conversely 

-oo 

If /is an L 2 -function on R or R^, say, then so is f and j^_ x ^ — f(x) - Moreover, the transform interchanges 

operations of multiplication and convolution on the corresponding function spaces. A conceptual explanation of the 
Fourier transform is obtained by the aforementioned Pontryagin duality, applied to the locally compact groups R (or 
R^ etc.): any character of R is given by ^ i — > e -27rza: f . The dualizing character of Fourier transform has many 

other manifestations, for example, in alternative descriptions of quantum mechanical systems in terms of coordinate 
and momentum representations. 

• Laplace transform is similar to Fourier transform and interchanges operators of multiplication by polynomials 
with constant coefficient linear differential operators. 

• Legendre transformation is an important analytic duality which switches between velocities in Lagrangian 
mechanics and momenta in Hamiltonian mechanics. 
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Poincare-style dualities 

Theorems showing that certain objects of interest are the dual spaces (in the sense of linear algebra) of other objects 
of interest are often called dualities. Many of these dualities are given by a bilinear pairing of two K- vector spaces 

A <g> B -> K. 

For perfect pairings, there is, therefore, an isomorphism of A to the dual of B. 

For example, Poincare duality of a smooth compact complex manifold X is given by a pairing of singular 
cohomology with C-coefficients (equivalently, sheaf cohomology of the constant sheaf C) 



where n is the (complex) dimension of X. Poincare duality can also be expressed as a relation of singular 
homology and de Rham cohomology, by asserting that the map 



j 7 

(integrating a differential &-form over an 2ft-&-(real)-dimensional cycle) is a perfect pairing. 

The same duality pattern holds for a smooth projective variety over a separably closed field, using 1-adic 
cohomology with Q^-coefficients instead P 9 ^ This is further generalized to possibly singular varieties, using 
intersection cohomology instead, a duality called Verdier duality P 0 ^ With increasing level of generality, it turns out, 
an increasing amount of technical background is helpful or necessary to understand these theorems: the modern 
formulation of both these dualities can be done using derived categories and certain direct and inverse image 
functors of sheaves, applied to locally constant sheaves (with respect to the classical analytical topology in the first 
case, and with respect to the etale topology in the second case). 

Yet another group of similar duality statements is encountered in arithmetics: etale cohomology of finite, local and 
global fields (also known as Galois cohomology, since etale cohomology over a field is equivalent to group 
cohomology of the (absolute) Galois group of the field) admit similar pairings. The absolute Galois group G(FJ of a 
finite field, for example, is isomorphic to ^ , the profinite completion of Z, the integers. Therefore, the perfect 
pairing (for any G-module M) 

H n (G, M) x H 1 "" (G, Horn (M, Q/Z)) -> Q/Z [21] 

is a direct consequence of Pontryagin duality of finite groups. For local and global fields, similar statements exist 

T221 

(local duality and global or Poitou-Tate duality). 1 

Serre duality or coherent duality are similar to the statements above, but applies to cohomology of coherent sheaves 
instead P 3 ^ 

• Alexander duality 
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Anabelian geometry 

Anabelian geometry is a proposed theory in mathematics, describing the way the algebraic fundamental group G of 
an algebraic variety V, or some related geometric object, determines how V can be mapped into another geometric 
object W, under the assumption that G is very far from being an abelian group, in a sense to be made more precise. 
The word anabelian (an alpha privative an- before abelian) was introduced in Esquisse d'un Programme, an 
influential manuscript of Alexander Grothendieck, circulated in the 1980s J ^ 

While the work of Grothendieck was for many years unpublished, and unavailable through the traditional formal 
scholarly channels, the formulation and predictions of the proposed theory received much attention, and some 
alterations, at the hands of a number of mathematicians. Those who have researched in this area have obtained some 
expected and related results, and in the 21st century the beginnings of such a theory started to be available. 

Formulation of a conjecture of Grothendieck on curves 

The "anabelian question" has been formulated as 

how much information about the isomorphism class of the variety X is contained in the knowledge of the etale fundamental group?^ J 

A concrete example is the case of curves, which may be affine as well as projective. Suppose given a hyperbolic 
curve C, i.e. the complement of n points in a projective algebraic curve of genus g, taken to be smooth and 
irreducible, defined over a field K that is finitely generated (over its prime field), such that 

2 - 2g - n < 0. 

Grothendieck conjectured that the algebraic fundamental group G of C, a profinite group, determines C itself (i.e. the 
isomorphism class of G determines that of C). This was proved by Shinichi Mochizuki. An example is for the case 
of g = 0 (the projective line) and n = 4, when the isomorphism class of C is determined by the cross-ratio in K of the 
four points removed (almost, there being an order to the four points in a cross-ratio, but not in the points removed) 
There are also results for the case of K a local field 
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External links 

• Heidelberg Lectures on Fundamental Groups (http://www.renyi.hu/~szamuely/heid.pdf), section 5. 

Noncommutative geometry 

Noncommutative geometry (NCG), is a branch of mathematics concerned with geometric approach to 
noncommutative algebras, and with construction of spaces which are locally presented by noncommutative algebras 
of functions (possibly in some generalized sense). A noncommutative algebra is here an associative algebra in which 
the multiplication is not commutative, that is, for which xy does not always equal yx\ or more generally an algebraic 
structure in which one of the principal binary operations is not commutative; one also allows additional structures, 
e.g. topology or norm to be possibly carried by the noncommutative algebra of functions. The leading direction in 
noncommutative geometry has been laid by French mathematician Alain Connes since his involvement from about 
1979. 

Motivation 

Main motivation is to extend the commutative duality between spaces and functions to the noncommutative setting. 
In mathematics, there is a close relationship between spaces, which are geometric in nature, and the numerical 
functions on them. In general, such functions will form a commutative ring. For instance, one may take the ring C(X) 
of continuous complex- valued functions on a topological space X. In many important cases (e.g., if X is a compact 
Hausdorff space), we can recover X from C(X), and therefore it makes some sense to say that X has commutative 
geometry. 

More specifically, in topology, compact Hausdorff topological spaces can be reconstructed from the Banach algebra 
of functions on the space (Gel'fand-Neimark). In commutative algebraic geometry, algebraic schemes are locally 
prime spectra of commutative unital rings (A. Grothendieck), and schemes can be reconstructed from the categories 
of quasicoherent sheaves of modules on them (P. Gabriel-A. Rosenberg). For Grothendieck topologies, the 
cohomological properties of a site are invariant of the corresponding category of sheaves of sets viewed abstractly as 
a topos (A. Grothendieck). In all these cases, a space is reconstructed from the algebra of functions or its categorified 
version — some category of sheaves on that space. 

Functions on a topological space can be multiplied and added pointwise hence they form a commutative algebra; in 
fact these operations are local in the topology of the base space, hence the functions form a sheaf of commutative 
rings over the base space. 

The dream of noncommutative geometry is to generalize this duality to the duality between 

• noncommutative algebras, or sheaves of noncommutative algebras, or sheaf-like noncommutative algebraic or 
operator-algebraic structures 

• and geometric entities of certain kind, 

and interact between the algebraic and geometric description of those via this duality. 
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Regarding that the commutative rings correspond to usual affine schemes, and commutative C*-algebras to usual 
topological spaces, the extension to noncommutative rings and algebras requires non-trivial generalization of 
topological spaces, as "non-commutative spaces". For this reason, some talk about non-commutative topology, 
though the term has also other meanings. 

Applications in mathematical physics 

Some applications in particle physics are described on the entries Noncommutative standard model and 
Noncommutative quantum field theory. Sudden rise in interest in noncommutative geometry in physics, follows after 
the speculations of its role in M-theory made in 1997^ . 

Motivation from ergodic theory 

Some of the theory developed by Alain Connes to handle noncommutative geometry at a technical level has roots in 
older attempts, in particular in ergodic theory. The proposal of George Mackey to create a virtual subgroup theory, 
with respect to which ergodic group actions would become homogeneous spaces of an extended kind, has by now 
been subsumed. 

Non-commutative C* -algebras, von Neumann algebras 

(The formal duals of) non-commutative C*-algebras are often now called non-commutative spaces. This is by 
analogy with the Gelfand representation, which shows that commutative C*-algebras are dual to locally compact 
Hausdorff spaces. In general, one can associate to any C*-algebra S a topological space S; see spectrum of a 
C*-algebra. 

For the duality between a-finite measure spaces and commutative von Neumann algebras, noncommutative von 
Neumann algebras are called non-commutative measure spaces. 

Non-commutative differentiable manifolds 

A smooth Riemannian manifold M is a topological space with a lot of extra structure. From its algebra of continuous 
functions C(M) we only recover M topologically. The algebraic invariant that recovers the Riemannian structure is a 
spectral triple. It is constructed from a smooth vector bundle E over M, e.g. the exterior algebra bundle. The Hilbert 
space E^M.E) of square integrable sections of E carries a representation of C(M) by multiplication operators, and we 
consider an unbounded operator D in E^M^E) with compact resolvent (e.g. the signature operator), such that the 
commutators [D,f] are bounded whenever / is smooth. A recent deep theorem states that Mas a Riemannian 
manifold can be recovered from this data. 

This suggests that one might define a noncommutative Riemannian manifold as a spectral triple (A,H,D), consisting 
of a representation of a C*-algebra A on a Hilbert space H, together with an unbounded operator D on H, with 
compact resolvent, such that [D,a] is bounded for all a in some dense subalgebra of A. Research in spectral triples is 
very active, and many examples of noncommutative manifolds have been constructed. 
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Non-commutative affine and projective schemes 

In analogy to the duality between affine schemes and commutative rings, we define a category of noncommutative 
affine schemes as the dual of the category of associative unital rings. There are certain analogues of Zariski topology 
in that context so that one can glue such affine schemes to more general objects. 

There are also generalizations of the Cone and of the Proj of a commutative graded ring, mimicking a Serre's 
theorem on Proj. Namely the category of quasicoherent sheaves of O-modules on a Proj of a commutative graded 
algebra is equivalent to the category of graded modules over the ring localized on Serre's subcategory of graded 
modules of finite length; there is also analogous theorem for coherent sheaves when the algebra is Noetherian. This 
theorem is extended as a definition of noncommutative projective geometry by Michael Artin and J. J. Zhang 1 , 
who add also some general ring-theoretic conditions (e.g. Artin- Schelter regularity). 

Many properties of projective schemes extend to this context. For example, there exist an analog of the celebrated 
Serre duality for noncommutative projective schemes of Artin and Zhang 1 . 

A. L. Rosenberg has created a rather general relative concept of noncommutative quasicompact scheme (over a 
base category), abstracting the Grothendieck's study of morphisms of schemes and covers in terms of categories of 
quasicoherent sheaves and flat localization functors ^ . There is also another interesting approach via localization 
theory, due to Fred Van Oystaeyen, Luc Willaert and Alain Verschoeren, where the main concept is that of a 
schematic algebra 1 ^ . 



Invariants for noncommutative spaces 

Some of the motivating questions of the theory are concerned with extending known topological invariants to formal 
duals of noncommutative (operator) algebras and other replacements and candidates for noncommutative spaces. 
One of the main starting points of the Alain Connes' direction in noncommutative geometry is his spectacular 
discovery (and independently by Boris Tsygan) of a very important new homology theory associated to 
noncommutative associative algebras and noncommutative operator algebras, namely the cyclic homology and its 
relations to the algebraic K-theory (primarily via Connes-Chern character map). 

The theory of characteristic classes of smooth manifolds has been extended to spectral triples, employing the tools of 
operator K-theory and cyclic cohomology. Several generalizations of now classical index theorems allow for 
effective extraction of numerical invariants from spectral triples. The fundamental characteristic class in cyclic 
cohomology, the JLO cocycle, generalizes the classical Chern character. 



Examples of non-commutative spaces 

• In Weyl quantization, the symplectic phase space of classical mechanics is deformed into a non-commutative 
phase space generated by the position and momentum operators. 

• The standard model of particle physics is another example of a noncommutative geometry, cf noncommutative 
standard model. 

• The noncommutative torus, deformation of the function algebra of the ordinary torus, can be given the structure 
of a spectral triple. This class of examples has been studied intensively and still functions as a test case for more 
complicated situations. 

• Snyder space ^ 

• Noncommutative algebras arising from foliations. 

• Examples related to dynamical systems arising from number theory, such as the Gauss shift on continued 
fractions, give rise to noncommutative algebras that appear to have interesting noncommutative geometries. 
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Quantum Logics and Quantum Computers 



Multi-valued logic 

Multi-valued logics are 'logical calculi' in which there are more than two truth values. Traditionally, in Aristotle's 
logical calculus, there were only two possible values (i.e., "true" and "false") for any proposition. An obvious 
extension to classical two- valued logic is an ^-valued logic for n > 2. Those most popular in the literature are 
three-valued (e.g., Lukasiewicz's and Kleene's), — which accept the values "true", "false", and "unknown", — the 
finite-valued with more than 3 values, and the infinite-valued (e.g. fuzzy logic) logics. 

Relation to classical logic 

Logics are usually systems intended to codify rules for preserving some semantic property of propositions across 
transformations. In classical logic, this property is "truth." In a valid argument, the truth of the derived proposition is 
guaranteed if the premises are jointly true, because the application of valid steps preserves the property. However, 
that property doesn't have to be that of "truth"; instead, it can be some other concept. 

Multi- valued logics are intended to preserve the property of designationhood (or being designated). Since there are 
more than two truth values, rules of inference may be intended to preserve more than just whichever corresponds (in 
the relevant sense) to truth. For example, in a three- valued logic, sometimes the two greatest truth- values (when they 
are represented as e.g. positive integers) are designated and the rules of inference preserve these values. Precisely, a 
valid argument will be such that the value of the premises taken jointly will always be less than or equal to the 
conclusion. 

For example, the preserved property could be justification, the foundational concept of intuitionistic logic. Thus, a 
proposition is not true or false; instead, it is justified or flawed. A key difference between justification and truth, in 
this case, is that the law of excluded middle doesn't hold: a proposition that is not flawed is not necessarily justified; 
instead, it's only not proven that it's flawed. The key difference is the determinacy of the preserved property: One 
may prove that P is justified, that P is flawed, or be unable to prove either. A valid argument preserves justification 
across transformations, so a proposition derived from justified propositions is still justified. However, there are 
proofs in classical logic that depend upon the law of excluded middle; since that law is not usable under this scheme, 
there are propositions that cannot be proven that way. 

Relation to fuzzy logic 

Multi- valued logic is strictly related with fuzzy set theory and fuzzy logic. The notion of fuzzy subset was introduced 
by Lotfi Zadeh as a formalization of vagueness; i.e., the phenomenon that a predicate may apply to an object not 
absolutely, but to a certain degree, and that there may be borderline cases. Indeed, as in multi-valued logic, fuzzy 
logic admits truth values different from "true" and "false". As an example, usually the set of possible truth values is 
the whole interval [0,1]. Nevertheless, the main difference between fuzzy logic and multi- valued logic is in the aims. 
In fact, in spite of its philosophical interest (it can be used to deal with the Sorites paradox), fuzzy logic is devoted 
mainly to the applications. More precisely, there are two approaches to fuzzy logic. The first one is very closely 
linked with multi- valued logic tradition (Hajek school). So a set of designed values is fixed and this enables us to 
define an entailment relation. The deduction apparatus is defined by a suitable set of logical axioms and suitable 
inference rules. Another approach (Goguen, Pavelka and others) is devoted to defining a deduction apparatus in 
which approximate reasonings are admitted. Such an apparatus is defined by a suitable fuzzy subset of logical 
axioms and by a suitable set of fuzzy inference rules. In the first case the logical consequence operator gives the set 
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of logical consequence of a given set of axioms. In the latter the logical consequence operator gives the fuzzy subset 
of logical consequence of a given fuzzy subset of hypotheses. 

Another example of an infinitely- valued logic is probability logic. 

History 

The first known classical logician who didn't fully accept the law of excluded middle was Aristotle (who, ironically, 
is also generally considered to be the first classical logician and the "father of logic" ^ ), who admitted that his laws 
did not all apply to future events (De Interpretation , ch. IX). But he didn't create a system of multi-valued logic to 
explain this isolated remark. The later logicians until the coming of the 20th century followed Aristotelian logic, 
which includes or assumes the law of the excluded middle. 

The 20th century brought the idea of multi- valued logic back. The Polish logician and philosopher Jan Lukasiewicz 
began to create systems of many-valued logic in 1920, using a third value "possible" to deal with Aristotle's paradox 
of the sea battle. Meanwhile, the American mathematician Emil L. Post (1921) also introduced the formulation of 
additional truth degrees with n > 2,where n are the truth values. Later Jan Lukasiewicz and Alfred Tarski together 
formulated a logic on n truth values where n > 2 and in 1932 Hans Reichenbach formulated a logic of many truth 
values where ^ ^infinity. Kurt Godel in 1932 showed that intuitionistic logic is not a finitely-many valued logic, and 
defined a system of Godel logics intermediate between classical and intuitionistic logic; such logics are known as 
intermediate logics. 

References 

• Beziau J.-Y. 1997 What is many- valued logic ? Proceedings of the 27th International Symposium on 
Multiple-Valued Logic, IEEE Computer Society, Los Alamitos, pp. 117-121. 

• Chang C.C. and Keisler H. J. 1966. Continuous Model Theory, Princeton, Princeton University Press. 

• Cignoli, R. L. O., D'Ottaviano, I, M. L., Mundici, D., 2000. Algebraic Foundations of Many-valued Reasoning. 
Kluwer. 

• Hajek P., 1998, Metamathematics of fuzzy logic. Kluwer. 

• Malinowski, Gregorz, 2001, Many -Valued Logics, in Goble, Lou, ed., The Blackwell Guide to Philosophical 
Logic. Blackwell. 

• Gerla G. 2001, Fuzzy logic: Mathematical Tools for Approximate Reasoning, Kluwer Academic Publishers, 
Dordrecht. 

• Goguen J. A. 1968/69, The logic of inexact concepts, Synthese, 19, 325-373. 

• S. Gottwald, A Treatise on Many-Valued Logics. Studies in Logic and Computation, vol. 9, Research Studies 
Press: Baldock, Hertfordshire, England, 2001. 

• Pavelka J. 1979, On fuzzy logic I: Many -valued rules of inference, Zeitschr. f. math. Logik und Grundlagen d. 
Math., 25, 45-52. 

• Prior A. 1957, Time and Modality. Oxford University Press, based on his 1956 John Locke lectures 



Multi- valued logic 



454 



External links 

• Stanford Encyclopedia of Philosophy: "Many-Valued Logic — by Siegfried Gottwald. 

Notes 

[1] Hurley, Patrick. A Concise Introduction to Logic, 9th edition. (2006). 
[2] http://plato.stanford.edu/entries/logic-manyvalued/ 

Quantum information 

In quantum mechanics, quantum information is physical information that is held in the "state" of a quantum 
system. The most popular unit of quantum information is the qubit, a two-level quantum system. However, unlike 
classical digital states (which are discrete), a two-state quantum system can actually be in a superposition of the two 
states at any given time. 

Quantum information differs from classical information in several respects, among which we note the following: 

• It cannot be read without the state becoming the measured value, 

• An arbitrary state cannot be cloned, 

• The state may be in a superposition of basis values. 

However, despite this, the amount of information that can be retrieved in a single qubit is equal to one bit. It is in the 
processing of information (quantum computation) that a difference occurs. 

The ability to manipulate quantum information enables us to perform tasks that would be unachievable in a classical 
context, such as unconditionally secure transmission of information. Quantum information processing is the most 
general field that is concerned with quantum information. There are certain tasks which classical computers cannot 
perform "efficiently" (that is, in polynomial time) according to any known algorithm. However, a quantum computer 
can compute the answer to some of these problems in polynomial time; one well-known example of this is Shor's 
factoring algorithm. Other algorithms can speed up a task less dramatically - for example, Grover's search algorithm 
which gives a quadratic speed-up over the best possible classical algorithm. 

Quantum information, and changes in quantum information, can be quantitatively measured by using an analogue of 
Shannon entropy, called the von Neumann Entropy. Given a statistical ensemble of quantum mechanical systems 
with the density matrix p , it is given by 

S{p) = -Tr(plnp). 

Many of the same entropy measures in classical information theory can also be generalized to the quantum case, 
such as Holevo entropy ^ and the conditional quantum entropy. 

Quantum information theory 

The theory of quantum information is a result of the effort to generalise classical information theory to the quantum 
world. Quantum information theory aims to answer the following question: 

What happens if information is stored in a state of a quantum system? 

One of the strengths of classical information theory is that physical representation of information can be disregarded: 
There is no need for an 'ink-on-paper' information theory or a 'DVD information' theory. This is because it is always 
possible to efficiently transform information from one representation to another. However, this is not the case for 
quantum information: it is not possible, for example, to write down on paper the previously unknown information 
contained in the polarisation of a photon. 



Quantum information 



455 



In general, quantum mechanics does not allow us to read out the state of a quantum system with arbitrary precision. 
The existence of Bell correlations between quantum systems cannot be converted into classical information. It is 
only possible to transform quantum information between quantum systems of sufficient information capacity. The 
information content of a message J\/[ can, for this reason, be measured in terms of the minimum number n of 
two-level systems which are needed to store the message: J\/[ consists of n qubits. In its original theoretical sense, 
the term qubit is thus a measure for the amount of information. A two-level quantum system can carry at most one 
qubit, in the same sense a classical binary digit can carry at most one classical bit. 

As a consequence of the noisy-channel coding theorem, noise limits the information content of an analog 
information carrier to be finite. It is very difficult to protect the remaining finite information content of analog 
information carriers against noise. The example of classical analog information shows that quantum information 
processing schemes must necessarily be tolerant against noise, otherwise there would not be a chance for them to be 
useful. It was a big breakthrough for the theory of quantum information, when quantum error correction codes and 
fault-tolerant quantum computation schemes were discovered. 



External links and references 

Lectures at the Institut Henri Poincare (slides and videos) 
Quantum Information Theory at ETH Zurich L J 
Quantum Information ^ Perimeter Institute for Theoretical Physics 

Center for Quantum Computation ^ - The CQC, part of Cambridge University, is a group of researchers studying 
quantum information, and is a useful portal for those interested in this field. 

Quantum Information Group ^ The quantum information research group at the University of Nottingham. 
Qwiki - A quantum physics wiki devoted to providing technical resources for practicing quantum information 
scientists. 

Quantiki ^ - A wiki portal for quantum information with introductory tutorials. 

Charles H. Bennett and Peter W. Shor, "Quantum Information Theory," IEEE Transactions on Information 
Theory, Vol 44, pp 2724-2742, Oct 1998 

rm 

Institute for Quantum Computing - The Institute for Quantum Computing, based in Waterloo, ON Canada, is a 
research institute working in conjunction with the University of Waterloo ^ and Perimeter Institute on the 
subject of Quantum Information. 

ri2i 

Quantum information can be negative 

Gregg Jaeger's book on Quantum Information [13] (published by Springer, New York, 2007, ISBN 0-387-35725-4) 

The International Conference on Quantum Information (ICQI) ^ 

Institute of Quantum Information ^ Caltech 

Quantum Information Theory ^ Imperial College 

ri7i 

Quantum Information Technology Toshiba Research 

International Journal of Quantum Information ^ World Scientific 

ri9i 

USC Center for Quantum Information Science & Technology 
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Quantum logic 

In quantum mechanics, quantum logic is a set of rules for reasoning about propositions which takes the principles of 
quantum theory into account. This research area and its name originated in the 1936 paper by Garrett Birkhoff and 
John von Neumann, who were attempting to reconcile the apparent inconsistency of classical boolean logic with the 
facts concerning the measurement of complementary variables in quantum mechanics, such as position and 
momentum. 

Quantum logic can be formulated either as a modified version of propositional logic or as a noncommutative and 
non-associative many- valued (MV) logic ^ ^ ^ ^ ^ . 

Quantum logic has some properties which clearly distinguish it from classical logic, most notably, the failure of the 
distributive law of propositional logic: 

p and (q or r) = (p and q) or (p and r), 

where the symbols p, q and r are propositional variables. To illustrate why the distributive law fails, consider a 
particle moving on a line and let 

p = "the particle is moving to the right" 

q = "the particle is in the interval [-1,1]" 

r = "the particle is not in the interval [-1,1]" 

then the proposition "q or r" is true, so 

p and (q or r) = p 

On the other hand, the propositions "/? and q" and "/? and r" are both false, since they assert tighter restrictions on 
simultaneous values of position and momentum than is allowed by the uncertainty principle. So, 

(p and q) or (p and r) = false 

Thus the distributive law fails. 

Quantum logic has been proposed as the correct logic for propositional inference generally, most notably by the 
philosopher Hilary Putnam, at least at one point in his career. This thesis was an important ingredient in Putnam's 
paper Is Logic Empirical? in which he analysed the epistemological status of the rules of propositional logic. Putnam 
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attributes the idea that anomalies associated to quantum measurements originate with anomalies in the logic of 
physics itself to the physicist David Finkelstein. However, this idea had been around for some time and had been 
revived several years earlier by George Mackey's work on group representations and symmetry. 

The more common view regarding quantum logic, however, is that it provides a formalism for relating observables, 
system preparation filters and states. In this view, the quantum logic approach resembles more closely the 
C*-algebraic approach to quantum mechanics; in fact with some minor technical assumptions it can be subsumed by 
it. The similarities of the quantum logic formalism to a system of deductive logic may then be regarded more as a 
curiosity than as a fact of fundamental philosophical importance. A more modern approach to the structure of 
quantum logic is to assume that it is a diagram - in the sense of category theory - of classical logics (see David 
Edwards). 

Introduction 

In his classic treatise Mathematical Foundations of Quantum Mechanics, John von Neumann noted that projections 
on a Hilbert space can be viewed as propositions about physical observables. The set of principles for manipulating 
these quantum propositions was called quantum logic by von Neumann and Birkhoff. In his book (also called 
Mathematical Foundations of Quantum Mechanics) G. Mackey attempted to provide a set of axioms for this 
propositional system as an orthocomplemented lattice. Mackey viewed elements of this set as potential yes or no 
questions an observer might ask about the state of a physical system, questions that would be settled by some 
measurement. Moreover Mackey defined a physical observable in terms of these basic questions. Mackey's axiom 
system is somewhat unsatisfactory though, since it assumes that the partially ordered set is actually given as the 
orthocomplemented closed subspace lattice of a separable Hilbert space. Piron, Ludwig and others have attempted to 
give axiomatizations which do not require such explicit relations to the lattice of subspaces. 

The remainder of this article assumes the reader is familiar with the spectral theory of self-adjoint operators on a 
Hilbert space. However, the main ideas can be understood using the finite-dimensional spectral theorem. 

Projections as propositions 

The so-called Hamiltonian formulations of classical mechanics have three ingredients: states, observables and 

3 6 
dynamics. In the simplest case of a single particle moving in R , the state space is the position-momentum space R . 

We will merely note here that an observable is some real-valued function / on the state space. Examples of 

observables are position, momentum or energy of a particle. For classical systems, the value fix), that is the value off 

for some particular system state x, is obtained by a process of measurement of /. The propositions concerning a 

classical system are generated from basic statements of the form 

• Measurement off yields a value in the interval [a, b] for some real numbers a, b. 

It follows easily from this characterization of propositions in classical systems that the corresponding logic is 
identical to that of some Boolean algebra of subsets of the state space. By logic in this context we mean the rules that 
relate set operations and ordering relations, such as de Morgan's laws. These are analogous to the rules relating 
boolean conjunctives and material implication in classical propositional logic. For technical reasons, we will also 
assume that the algebra of subsets of the state space is that of all Borel sets. The set of propositions is ordered by the 
natural ordering of sets and has a complementation operation. In terms of observables, the complement of the 
proposition {f> a} is {f< a}. 

We summarize these remarks as follows: 

• The proposition system of a classical system is a lattice with a distinguished orthocomplementation operation: 
The lattice operations of meet and join are respectively set intersection and set union. The orthocomplementation 
operation is set complement. Moreover this lattice is sequentially complete, in the sense that any sequence {E} of 
elements of the lattice has a least upper bound, specifically the set-theoretic union: 
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LVB({E i }) = (jE i . 

In the Hilbert space formulation of quantum mechanics as presented by von Neumann, a physical observable is 
represented by some (possibly unbounded) densely-defined self-adjoint operator A on a Hilbert space H. A has a 
spectral decomposition, which is a projection- valued measure E defined on the Borel subsets of R. In particular, for 
any bounded Borel function/, the following equation holds: 

f(A)= f /(A)rfE(A). 

JR 

In case f is the indicator function of an interval [a, b], the operator /(A) is a self-adjoint projection, and can be 
interpreted as the quantum analogue of the classical proposition 

• Measurement of A yields a value in the interval [a, b]. 

The propositional lattice of a quantum mechanical system 

This suggests the following quantum mechanical replacement for the orthocomplemented lattice of propositions in 
classical mechanics. This is essentially Mackey's Axiom VII: 

• The orthocomplemented lattice Q of propositions of a quantum mechanical system is the lattice of closed 
subspaces of a complex Hilbert space H where orthocomplementation of V is the orthogonal complement V 1 . 

Q is also sequentially complete: any pairwise disjoint sequence! V.} . of elements of Q has a least upper bound. Here 

i i 

disjointness of W 1 and W 2 means W 2 is a subspace of . The least upper bound of { V.}. is the closed internal direct 
sum. 

Henceforth we identify elements of Q with self-adjoint projections on the Hilbert space H. 

The structure of Q immediately points to a difference with the partial order structure of a classical proposition 
system. In the classical case, given a proposition p, the equations 

I = pVq 

0 = p A q 

have exactly one solution, namely the set-theoretic complement of p. In these equations / refers to the atomic 
proposition which is identically true and 0 the atomic proposition which is identically false. In the case of the lattice 
of projections there are infinitely many solutions to the above equations. 

Having made these preliminary remarks, we turn everything around and attempt to define observables within the 
projection lattice framework and using this definition establish the correspondence between self-adjoint operators 
and observables: A Mackey observable is a countably additive homomorphism from the orthocomplemented lattice 
of the Borel subsets of R to Q. To say the mapping cp is a countably additive homomorphism means that for any 
sequence {S.}. of pairwise disjoint Borel subsets of R, {cp(S.)} . are pairwise orthogonal projections and 

(oo \ oo 
Us) 
i=l / i=l 

Theorem. There is a bijective correspondence between Mackey observables and densely-defined self-adjoint 
operators on H. 

This is the content of the spectral theorem as stated in terms of spectral measures. 
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Statistical structure 

Imagine a forensics lab which has some apparatus to measure the speed of a bullet fired from a gun. Under carefully 
controlled conditions of temperature, humidity, pressure and so on the same gun is fired repeatedly and speed 
measurements taken. This produces some distribution of speeds. Though we will not get exactly the same value for 
each individual measurement, for each cluster of measurements, we would expect the experiment to lead to the same 
distribution of speeds. In particular, we can expect to assign probability distributions to propositions such as {a < 
speed < b}. This leads naturally to propose that under controlled conditions of preparation, the measurement of a 
classical system can be described by a probability measure on the state space. This same statistical structure is also 
present in quantum mechanics. 

A quantum probability measure is a function P defined on Q with values in [0,1] such that P(0)=0, P(I)=1 and if 
{E.}.is a sequence of pairwise orthogonal elements of Q then 

(oo \ oo 
i=l / 1=1 

The following highly non-trivial theorem is due to Andrew Gleason: 

Theorem. Suppose H is a separable Hilbert space of complex dimension at least 3. Then for any quantum 
probability measure on Q there exists a unique trace class operator S such that 

P(E) = Ti(SE) 

for any self-adjoint projection E. 

The operator S is necessarily non-negative (that is all eigenvalues are non-negative) and of trace 1. Such an operator 
is often called a density operator. 

Physicists commonly regard a density operator as being represented by a (possibly infinite) density matrix relative to 
some orthonormal basis. 

For more information on statistics of quantum systems, see quantum statistical mechanics. 

Automorphisms 

An automorphism of Q is a bijective mapping a:Q —> Q which preserves the orthocomplemented structure of Q, that 
is 

(oo \ oo 
i=l / i=l 

for any sequence {£.} . of pairwise orthogonal self-adjoint projections. Note that this property implies monotonicity 
of a. If P is a quantum probability measure on Q, then E —> a(E) is also a quantum probability measure on Q. By the 
Gleason theorem characterizing quantum probability measures quoted above, any automorphism a induces a 
mapping a* on the density operators by the following formula: 

Ti(a*(S)E) =Tr(Sa(E)). 

The mapping a* is bijective and preserves convex combinations of density operators. This means 

a*(nSi + r 2 S 2 ) = na*(5i) + r 2 a*(S 2 ) 

whenever 1 = + r^ and r^ r^ are non-negative real numbers. Now we use a theorem of Richard V. Kadison: 

Theorem. Suppose (3 is a bijective map from density operators to density operators which is convexity preserving. 
Then there is an operator U on the Hilbert space which is either linear or conjugate-linear, preserves the inner 
product and is such that 

f3{S) = usu* 

for every density operator S. In the first case we say U is unitary, in the second case U is anti-unitary. 
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Remark. This note is included for technical accuracy only, and should not concern most readers. The 
result quoted above is not directly stated in Kadison's paper, but can be reduced to it by noting first that 
(3 extends to a positive trace preserving map on the trace class operators, then applying duality and 
finally applying a result of Kadison's paper. 

The operator U is not quite unique; if r is a complex scalar of modulus 1, then r U will be unitary or anti-unitary if U 
is and will implement the same automorphism. In fact, this is the only ambiguity possible. 

It follows that automorphisms of Q are in bijective correspondence to unitary or anti-unitary operators modulo 
multiplication by scalars of modulus 1. Moreover, we can regard automorphisms in two equivalent ways: as 
operating on states (represented as density operators) or as operating on Q. 

Non-relativistic dynamics 

In non-relativistic physical systems, there is no ambiguity in referring to time evolution since there is a global time 
parameter. Moreover an isolated quantum system evolves in a deterministic way: if the system is in a state S at time t 
then at time s > t, the system is in a state F^ £S). Moreover, we assume 

• The dependence is reversible: The operators F^ ^ are bijective. 

• The dependence is homogeneous: F = F _ . 

• The dependence is convexity preserving: That is, each F^ ^(5) is convexity preserving. 

• The dependence is weakly continuous: The mapping R given by t —> Tr(F^ ^(5) E) is continuous for every E 
in Q. 

By Kadison's theorem, there is a 1 -parameter family of unitary or anti-unitary operators {U} such that 

F M (5) = u s _ t su;_ t 

In fact, 

Theorem. Under the above assumptions, there is a strongly continuous 1 -parameter group of unitary operators {U } 
such that the above equation holds. 

Note that it easily from uniqueness from Kadison's theorem that 
U t+s = a{t,s)U t U s 

where a(t,s) has modulus 1. Now the square of an anti-unitary is a unitary, so that all the U are unitary. The 
remainder of the argument shows that a(t,s) can be chosen to be 1 (by modifying each U by a scalar of modulus 1.) 

Pure states 

A convex combination of statistical states and is a state of the form S = p l +p 2 where P 2 are 
non-negative and p + p =1. Considering the statistical state of system as specified by lab conditions used for its 
preparation, the convex combination S can be regarded as the state formed in the following way: toss a biased coin 
with outcome probabilities p , p 2 and depending on outcome choose system prepared to S or S 2 

Density operators form a convex set. The convex set of density operators has extreme points; these are the density 
operators given by a projection onto a one-dimensional space. To see that any extreme point is such a projection, 
note that by the spectral theorem S can be represented by a diagonal matrix; since S is non-negative all the entries are 
non-negative and since S has trace 1, the diagonal entries must add up to 1. Now if it happens that the diagonal 
matrix has more than one non-zero entry it is clear that we can express it as a convex combination of other density 
operators. 

The extreme points of the set of density operators are called pure states. If S is the projection on the 1 -dimensional 
space generated by a vector ip of norm 1 then 

Tt(SE) = (Efl>\lf>) 
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for any E in Q. In physics jargon, if 

s = |v) (VI, 

where has norm 1, then 

Tv(SE) = <V|£|V). 

Thus pure states can be identified with rays in the Hilbert space H. 



The measurement process 

Consider a quantum mechanical system with lattice Q which is in some statistical state given by a density operator S. 
This essentially means an ensemble of systems specified by a repeatable lab preparation process. The result of a 
cluster of measurements intended to determine the truth value of proposition E, is just as in the classical case, a 
probability distribution of truth values T and F. Say the probabilities are p for T and q = 1 - p for F. By the previous 
section p = Tv(S E) and q = Tr(S (I - E)). 

Perhaps the most fundamental difference between classical and quantum systems is the following: regardless of what 
process is used to determine E immediately after the measurement the system will be in one of two statistical states: 

• If the result of the measurement is T 

} - ESE. 
Tt(ES) 

• If the result of the measurement is F 

■(/ - E)S{I - E). 



Tr(( J - E)S) 

(We leave to the reader the handling of the degenerate cases in which the denominators may be 0.) We now form the 
convex combination of these two ensembles using the relative frequencies p and q. We thus obtain the result that the 
measurement process applied to a statistical ensemble in state S yields another ensemble in statistical state: 

M E (S) = ESE + (I - E)S(I - E). 

We see that a pure ensemble becomes a mixed ensemble after measurement. Measurement, as described above, is a 
special case of quantum operations. 



Limitations 

Quantum logic derived from propositional logic provides a satisfactory foundation for a theory of reversible quantum 
processes. Examples of such processes are the covariance transformations relating two frames of reference, such as 
change of time parameter or the transformations of special relativity. Quantum logic also provides a satisfactory 
understanding of density matrices. Quantum logic can be stretched to account for some kinds of measurement 
processes corresponding to answering yes-no questions about the state of a quantum system. However, for more 
general kinds of measurement operations (that is quantum operations), a more complete theory of filtering processes 
is necessary. Such an approach is provided by the consistent histories formalism. On the other hand, quantum logics 
derived from MV-logic extend its range of applicability to irreversible quantum processes and/or 'open' quantum 
systems. 

In any case, these quantum logic formalisms must be generalized in order to deal with super-geometry (which is 
needed to handle Fermi-fields) and non-commutative geometry (which is needed in string theory and quantum 
gravity theory). Both of these theories use a partial algebra with an "integral" or "trace". The elements of the partial 
algebra are not observables; instead the "trace" yields "greens functions" which generate scattering amplitudes. One 
thus obtains a local S-matrix theory (see D. Edwards). 

Since around 1978 the Flato school (see F. Bay en) has been developing an alternative to the quantum logics 
approach called deformation quantization (see Weyl quantization). 
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In 2004, Prakash Panangaden described how to capture the kinematics of quantum causal evolution using System 
BV, a deep inference logic originally developed for use in structural proof theory. 1 ^ Alessio Guglielmi, Lutz 
StraBburger, and Richard Blute have also done work in this area. 
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Quantum computer 



A quantum computer is a device for computation that makes direct 
use of quantum mechanical phenomena, such as superposition and 
entanglement, to perform operations on data. Quantum computers are 
different from traditional computers based on transistors. The basic 
principle behind quantum computation is that quantum properties can 
be used to represent data and perform operations on these data J ^ A 
theoretical model is the quantum Turing machine, also known as the 
universal quantum computer. 

Although quantum computing is still in its infancy, experiments have 
been carried out in which quantum computational operations were 
executed on a very small number of qubits (quantum bit). Both 
practical and theoretical research continues, and many national 
government and military funding agencies support quantum computing 
research to develop quantum computers for both civilian and national 
security purposes, such as cryptanalysis. 

If large-scale quantum computers can be built, they will be able to solve certain problems much faster than any 
current classical computers (for example Shor's algorithm). Quantum computers do not allow the computation of 
functions that are not theoretically computable by classical computers, i.e. they do not alter the Church-Turing 
thesis. The gain is only in efficiency. 

Basis 

A classical computer has a memory made up of bits, where each bit represents either a one or a zero. A quantum 
computer maintains a sequence of qubits. A single qubit can represent a one, a zero, or, crucially, any quantum 
superposition of these; moreover, a pair of qubits can be in any quantum superposition of 4 states, and three qubits in 
any superposition of 8. In general a quantum computer with n qubits can be in an arbitrary superposition of up to 
2 71 different states simultaneously (this compares to a normal computer that can only be in one of these 2 n states at 
any one time). A quantum computer operates by manipulating those qubits with a fixed sequence of quantum logic 
gates. The sequence of gates to be applied is called a quantum algorithm. 

An example of an implementation of qubits for a quantum computer could start with the use of particles with two 
spin states: "down" and "up" (typically written ||) and ||) , or |0) and |1) ). But in fact any system possessing 
an observable quantity A which is conserved under time evolution and such that A has at least two discrete and 
sufficiently spaced consecutive eigenvalues, is a suitable candidate for implementing a qubit. This is true because 
any such system can be mapped onto an effective spin- 1/2 system. 





The Bloch sphere is a representation of a qubit, 
the fundamental building block of quantum 
computers. 



Quantum computer 



464 



Bits vs. qubits 



Consider first a classical computer that operates on a three-bit 
register. The state of the computer at any time is a probability 
distribution over the 2 3 = 8 different three-bit strings 000 , 
001, 010, 011, 100, 101, 110, 111. If it is a 
deterministic computer, then it is in exactly one of these states 
with probability 1. However, if it is a probabilistic computer, then 
there is a possibility of it being in any one of a number of different 
states. We can describe this probabilistic state by eight 
nonnegative numbers a,b,c,d,ef,g,h (where a = probability 
computer is in state 0 0 0,/? = probability computer is in state 001, 
etc.). There is a restriction that these probabilities sum to 1. 



HO) 



«* |0101) ~ |5) 



~ |4) + |5) 



qubits can be in a superposition of all the 
clasically allowed states 

Qubits are made up of controlled particles and the 

means of control (e.g. devices that trap particles and 

[3] 

switch them from one state to another). 



The state of a three-qubit quantum computer is similarly described by an eight-dimensional vector (a,b,c,d,ef,g,h), 
called a ket. However, instead of adding to one, the sum of the squares of the coefficient magnitudes, 
\a\ 2 + |6| 2 + ... + \h\ 2 , must equal one. Moreover, the coefficients are complex numbers. Since states are 
represented by complex wavef unctions, two states being added together will undergo interference. This is a key 
difference between quantum computing and probabilistic classical computing J 4 ^ 

If you measure the three qubits, then you will observe a three-bit string. The probability of measuring a string will 
equal the squared magnitude of that string's coefficients (using our example, probability that we read state as 0 0 0 = 
|a| 2 , probability that we read state as 001 = |fc| 2 , etc.). Thus a measurement of the quantum state with 

coefficients (a,b,...,h) gives the classical probability distribution (|a| 2 , |b| 2 , \h\ 2 )- We say that the quantum 

state "collapses" to a classical state. 

Note that an eight-dimensional vector can be specified in many different ways, depending on what basis you choose 
for the space. The basis of three-bit strings 000, 001, Ill is known as the computational basis, and is often 
convenient, but other bases of unit-length, orthogonal vectors can also be used. Ket notation is often used to make 
explicit the choice of basis. For example, the state (a,b,c,d,ef,g,h) in the computational basis can be written as 

a |000) + b |001) + c |010) + d |011) + e |100) + / |101) + g |110) + h |111) , where, e.g., 
1 010) =(0,0,1,0,0,0,0,0). 

The computational basis for a single qubit (two dimensions) is |0) = (1,0), |1) = (0,1), but another common basis 
are the eigenvectors of the Pauli-x operator: |+) = ^ (1, l)and |— ) = (1, —1). 

Note that although recording a classical state of n bits, a 2^-dimensional probability distribution, requires an 
exponential number of real numbers, practically we can always think of the system as being exactly one of the n-b\i 
strings — we just don't know which one. Quantum mechanically, this is not the case, and all 2 n complex coefficients 

need to be kept track of to see how the quantum system evolves. For example, a 300-qubit quantum computer has a 

300 90 

state described by 2 (approximately 10 ) complex numbers, more than the number of atoms in the observable 
universe. 
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Operation 

While a classical three-bit state and a quantum three-qubit state are both eight-dimensional vectors, they are 
manipulated quite differently for classical or quantum computation. For computing in either case, the system must be 
initialized, for example into the all-zeros string, |000) , corresponding to the vector (1,0,0,0,0,0,0,0). In classical 
randomized computation, the system evolves according to the application of stochastic matrices, which preserve that 
the probabilities add up to one (i.e., preserve the LI norm). In quantum computation, on the other hand, allowed 
operations are unitary matrices, which are effectively rotations (they preserve that the sum of the squares add up to 
one, the Euclidean or L2 norm). (Exactly what unitaries can be applied depend on the physics of the quantum 
device.) Consequently, since rotations can be undone by rotating backward, quantum computations are reversible. 
(Technically, quantum operations can be probabilistic combinations of unitaries, so quantum computation really does 
generalize classical computation. See quantum circuit for a more precise formulation.) 

Finally, upon termination of the algorithm, the result needs to be read off. In the case of a classical computer, we 
sample from the probability distribution on the three-bit register to obtain one definite three-bit string, say 000. 
Quantum mechanically, we measure the three-qubit state, which is equivalent to collapsing the quantum state down 
to a classical distribution (with the coefficients in the classical state being the squared magnitudes of the coefficients 
for the quantum state, as described above) followed by sampling from that distribution. Note that this destroys the 
original quantum state. Many algorithms will only give the correct answer with a certain probability, however by 
repeatedly initializing, running and measuring the quantum computer, the probability of getting the correct answer 
can be increased. 

For more details on the sequences of operations used for various quantum algorithms, see universal quantum 
computer, Shor's algorithm, Grover's algorithm, Deutsch-Jozsa algorithm, amplitude amplification, quantum Fourier 
transform, quantum gate, quantum adiabatic algorithm and quantum error correction. 



Potential 

Integer factorization is believed to be computationally infeasible with an ordinary computer for large integers if they 
are the product of few prime numbers (e.g., products of two 300-digit primes) By comparison, a quantum 
computer could efficiently solve this problem using Shor's algorithm to find its factors. This ability would allow a 
quantum computer to decrypt many of the cryptographic systems in use today, in the sense that there would be a 
polynomial time (in the number of digits of the integer) algorithm for solving the problem. In particular, most of the 
popular public key ciphers are based on the difficulty of factoring integers (or the related discrete logarithm problem 
which can also be solved by Shor's algorithm), including forms of RSA. These are used to protect secure Web pages, 
encrypted email, and many other types of data. Breaking these would have significant ramifications for electronic 
privacy and security. 

However, other existing cryptographic algorithms don't appear to be broken by these algorithms. ^ ^ Some 
public-key algorithms are based on problems other than the integer factorization and discrete logarithm problems to 
which Shor's algorithm applies, like the McEliece cryptosystem based on a problem in coding theory. ^ ^ Lattice 
based cryptosy stems are also not known to be broken by quantum computers, and finding a polynomial time 
algorithm for solving the dihedral hidden subgroup problem, which would break many lattice based crypto systems, 

rm 

is a well-studied open problem. It has been proven that applying Grover's algorithm to break a symmetric (secret 
key) algorithm by brute force requires roughly 2 n/2 invocations of the underlying cryptographic algorithm, compared 
with roughly 2 n in the classical case/ 10 ^ meaning that symmetric key lengths are effectively halved: AES-256 would 
have the same security against an attack using Grover's algorithm that AES-128 has against classical brute-force 
search (see Key size). Quantum cryptography could potentially fulfill some of the functions of public key 
cryptography. 
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Besides factorization and discrete logarithms, quantum algorithms offering a more than polynomial speedup over the 
best known classical algorithm have been found for several problems, 11 1] including the simulation of quantum 
physical processes from chemistry and solid state physics, the approximation of Jones polynomials, and solving 
Pell's equation. No mathematical proof has been found that shows that an equally fast classical algorithm cannot be 
discovered, although this is considered unlikely. For some problems, quantum computers offer a polynomial 
speedup. The most well-known example of this is quantum database search, which can be solved by Grover's 
algorithm using quadratically fewer queries to the database than are required by classical algorithms. In this case the 
advantage is provable. Several other examples of provable quantum speedups for query problems have subsequently 
been discovered, such as for finding collisions in two-to-one functions and evaluating NAND trees. 

Consider a problem that has these four properties: 

1 . The only way to solve it is to guess answers repeatedly and check them, 

2. There are n possible answers to check, 

3. Every possible answer takes the same amount of time to check, and 

4. There are no clues about which answers might be better: generating possibilities randomly is just as good as 
checking them in some special order. 

An example of this is a password cracker that attempts to guess the password for an encrypted file (assuming that the 
password has a maximum possible length). 

For problems with all four properties, the time for a quantum computer to solve this will be proportional to the 
square root of n. That can be a very large speedup, reducing some problems from years to seconds. It can be used to 
attack symmetric ciphers such as Triple DES and AES by attempting to guess the secret key. 

Grover's algorithm can also be used to obtain a quadratic speed-up [over a brute-force search] for a class of problems 
known as NP-complete. 

Since chemistry and nanotechnology rely on understanding quantum systems, and such systems are impossible to 
simulate in an efficient manner classically, many believe quantum simulation will be one of the most important 
applications of quantum computing.^ 

There are a number of practical difficulties in building a quantum computer, and thus far quantum computers have 

only solved trivial problems. David DiVincenzo, of IBM, listed the following requirements for a practical quantum 

, [13] 
computer: 

• scalable physically to increase the number of qubits; 

• qubits can be initialized to arbitrary values; 

• quantum gates faster than decoherence time; 

• universal gate set; 

• qubits can be read easily. 

Quantum decoherence 

One of the greatest challenges is controlling or removing quantum decoherence. This usually means isolating the 
system from its environment as the slightest interaction with the external world would cause the system to decohere. 
This effect is irreversible, as it is non-unitary, and is usually something that should be highly controlled, if not 
avoided. Decoherence times for candidate systems, in particular the transverse relaxation time T (for NMR and MRI 
technology, also called the dephasing time), typically range between nanoseconds and seconds at low temperature. 

These issues are more difficult for optical approaches as the timescales are orders of magnitude shorter and an 
often-cited approach to overcoming them is optical pulse shaping. Error rates are typically proportional to the ratio 
of operating time to decoherence time, hence any operation must be completed much more quickly than the 
decoherence time. 
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If the error rate is small enough, it is thought to be possible to use quantum error correction, which corrects errors 
due to decoherence, thereby allowing the total calculation time to be longer than the decoherence time. An often 

-4 

cited figure for required error rate in each gate is 10 . This implies that each gate must be able to perform its task in 
one 10,000th of the decoherence time of the system. 

Meeting this scalability condition is possible for a wide range of systems. However, the use of error correction brings 

with it the cost of a greatly increased number of required qubits. The number required to factor integers using Shor's 

2 

algorithm is still polynomial, and thought to be between L and L , where L is the number of bits in the number to be 
factored; error correction algorithms would inflate this figure by an additional factor of L. For a 1000-bit number, 
this implies a need for about 10 4 qubits without error correction. 11 ^ With error correction, the figure would rise to 
about 10 qubits. Note that computation time is about £ 2 or about \Q 7 steps and on 1 MHz, about 10 seconds. 

A very different approach to the stability-decoherence problem is to create a topological quantum computer with 
anyons, quasi-particles used as threads and relying on braid theory to form stable logic gates. tl5] tl6] 



Developments 

There are a number of quantum computing candidates, among those: 

ri7i 

Superconductor-based quantum computers (including SQUID-based quantum computers) 
Trapped ion quantum computer 
Optical lattices 

Topological quantum computer ^ 18] 

Quantum dot on surface (e.g. the Loss-DiVincenzo quantum computer) 
Nuclear magnetic resonance on molecules in solution (liquid NMR) 
Solid state NMR Kane quantum computers 
Electrons on helium quantum computers 
Cavity quantum electrodynamics (CQED) 
Molecular magnet 

Fullerene-based ESR quantum computer 
Optic-based quantum computers (Quantum optics) 
Diamond-based quantum computer^ ^ ^ 
Bose-Einstein condensate-based quantum computer 1 

Transistor-based quantum computer - string quantum computers with entrainment of positive holes using an 
electrostatic trap 
Spin-based quantum computer 
Adiabatic quantum computation 1 

Rare-earth-metal-ion-doped inorganic crystal based quantum computers ^ ^ 

The large number of candidates demonstrates that the topic, in spite of rapid progress, is still in its infancy. But at the 
same time there is also a vast amount of flexibility. 

In 2005, researchers at the University of Michigan built a semiconductor chip which functioned as an ion trap. Such 
devices, produced by standard lithography techniques, may point the way to scalable quantum computing tools P 6 ^ 
An improved version was made in 2006. 

In 2009, researchers at Yale University created the first rudimentary solid-state quantum processor. The two-qubit 
superconducting chip was able to run elementary algorithms. Each of the two artificial atoms (or qubits) were made 
up of a billion aluminum atoms but they acted like a single one that could occupy two different energy states P 1 ^ ^ 

Another team, working at the University of Bristol, also created a silicon-based quantum computing chip, based on 
quantum optics. The team was able to run Shor's algorithm on the chip. 1 The latest developments [for 2010] can be 
found in ^ . 
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Relation to computational complexity theory 



PSPACE problems 




The suspected relationship of BQP to other 
problem spaces. 



The class of problems that can be efficiently solved by quantum 

computers is called BQP, for "bounded error, quantum, polynomial 

time". Quantum computers only run probabilistic algorithms, so BQP 

on quantum computers is the counterpart of BPP ("bounded error, 

probabilistic, polynomial time") on classical computers. It is defined as 

the set of problems solvable with a polynomial-time algorithm, whose 

T321 

probability of error is bounded away from one half. A quantum 
computer is said to "solve" a problem if, for every instance, its answer 
will be right with high probability. If that solution runs in polynomial 
time, then that problem is in BQP. 

BQP is contained in the complexity class #P (or more precisely in the 

#P T331 

associated class of decision problems P ), L which is a subclass of 
PSPACE. 

BQP is suspected to be disjoint from NP-complete and a strict superset of P, but that is not known. Both integer 
factorization and discrete log are in BQP. Both of these problems are NP problems suspected to be outside BPP, and 
hence outside P. Both are suspected to not be NP-complete. There is a common misconception that quantum 
computers can solve NP-complete problems in polynomial time. That is not known to be true, and is generally 
suspected to be false P 3 ^ 

Although quantum computers may be faster than classical computers, those described above can't solve any 
problems that classical computers can't solve, given enough time and memory (however, those amounts might be 
practically infeasible). A Turing machine can simulate these quantum computers, so such a quantum computer could 
never solve an undecidable problem like the halting problem. The existence of "standard" quantum computers does 
not disprove the Church— Turing thesis P^ It has been speculated that theories of quantum gravity, such as M-theory 
or loop quantum gravity, may allow even faster computers to be built. Currently, it's an open problem to even define 
computation in such theories due to the problem of time, i.e. there's no obvious way to describe what it means for an 
observer to submit input to a computer and later receive output. 



[35] 
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Algebraic Logic 



Boolean logic 

Boolean logic is a complete system for logical operations, used in many systems. It was named after George Boole, 
who first defined an algebraic system of logic in the mid 19th century. Boolean logic has many applications in 
electronics, computer hardware and software, and is the basis of all modern digital electronics. In 1938, Claude 
Shannon showed how electric circuits with relays could be modeled with Boolean logic. This fact soon proved 
enormously consequential with the emergence of the electronic computer. 

Using the algebra of sets, this article contains a basic introduction to sets, Boolean operations, Venn diagrams, truth 
tables, and Boolean applications. The Boolean algebra (structure) article discusses a type of algebraic structure that 
satisfies the axioms of Boolean logic. The binary arithmetic article discusses the use of binary numbers in computer 
systems. 



Set logic vs. Boolean logic 

Sets can contain any elements. We will first start out by discussing general set logic, then restrict ourselves to 
Boolean logic, where elements (or "bits") each contain only two possible values, called various names, such as "true" 
and "false", "yes" and "no", "on" and "off", or "1" and "0". 



Terms 



Let X be a set: 

• An element is one member of a set and is 
denoted by £ . If the element is not a 
member of a set it is denoted by ^ . 

• The universe is the set X, sometimes 
denoted by 1 . Note that this use of the 
word universe means "all elements being 
considered", which are not necessarily 
the same as "all elements there are ". 

• The empty set or null set is the set of no 

elements, denoted by 0 and sometimes 
0. 

• A unary operator applies to a single set. 
There is only one unary operator, called 
logical NOT. It works by taking the 
complement with respect to the universe, 

i.e. the set of all elements under consideration. 

• A binary operator applies to two sets. The basic binary operators are logical OR and logical AND. They 
perform the union and intersection of sets. There are also other derived binary operators, such as XOR (exclusive 
OR, i.e., "one or the other, but not both"). 

• A subset is denoted by A C B and means every element in set A is also in set B. 




Venn diagram showing the intersection of sets "A AND B" (in violet/dark 
shading), the union of sets "A OR B" (all the colored regions), and the exclusive 
OR case "set A XOR B" (all the colored regions except the violet). The "universe" 
is represented by all the area within the rectangular frame. 
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• A superset is denoted by A ~D B and means every element in set B is also in set A. 

• The identity or equivalence of two sets is denoted by A = B an d means that every element in set A is also in 
set B and every element in set B is also in set A. 

• A proper subset is denoted by A C B and means every element in set A is also in set B and the two sets are 
not identical. 

• A proper superset is denoted by A D B and means every element in set B is also in set A and the two sets are 
not identical. 

Example 

Imagine that set A contains all even numbers (multiples of two) in "the universe" (defined in the example below as 
all integers between 0 and 30 inclusive) and set B contains all multiples of three in "the universe". Then the 
intersection of the two sets (all elements in sets A AND B) would be all multiples of six in "the universe". The 
complement of set A (all elements NOT in set A) would be all odd numbers in "the universe". 



Chaining operations together 

While at most two sets are joined in 
any Boolean operation, the new set 
formed by that operation can then be 
joined with other sets utilizing 
additional Boolean operations. Using 
the previous example, we can define a 
new set C as the set of all multiples of 
five in "the universe". Thus "sets A 
AND B AND C" would be all 
multiples of 30 in "the universe". If 
more convenient, we may consider set 
AB to be the intersection of sets A and 
B, or the set of all multiples of six in 
"the universe". Then we can say "sets AB AND C" are the set of all multiples of 30 in "the universe". We could then 
take it a step further, and call this result set ABC. 

Use of parentheses 

While any number of logical ANDs (or any number of logical ORs) may be chained together without ambiguity, the 
combination of ANDs and ORs and NOTs can lead to ambiguous cases. In such cases, parentheses may be used to 
clarify the order of operations. As always, the operations within the innermost pair is performed first, followed by 
the next pair out, etc., until all operations within parentheses have been completed. Then any operations outside the 
parentheses are performed. 

Application to binary values 

In this example we have used natural numbers, while in Boolean logic binary numbers are used. The universe, for 
example, could contain just two elements, "0" and "1" (or "true" and "false", "yes" and "no", "on" or "off", etc.). We 
could also combine binary values together to get binary words, such as, in the case of two digits, "00", "01", "10", 
and "11". Applying set logic to those values, we could have a set of all values where the first digit is "0" ("00" and 
"01") and the set of all values where the first and second digits are different ("01" and "10"). The intersection of the 
two sets would then be the single element, "01". This could be shown by the following Boolean expression, where 



UNIVERSE (Integers from 0 to 30) 
1 7 11 13 17 19 23 29 
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"1st" is the first digit and "2nd" is the second digit: 
(NOT 1st) AND (1st XOR 2nd) 

Properties 

We define symbols for the two primary binary operations as A/fl (logical AND/set intersection) and V/U 
(logical OR/set union), and for the single unary operation — ■ / ~ (logical NOT/set complement). We will also use the 
values 0 (logical FALSE/the empty set) and 1 (logical TRUE/the universe). The following properties apply to both 
Boolean logic and set logic (although only the notation for Boolean logic is displayed here): 

a V (6 V c) = (a V 6) V c a A (b A c) = (a A b) A c associativity 

a V b = bV a a Ab = b A a commutativity 

a V (a A b) = a a A (a V b) = a absorption 
a V (6 A c) = (a V b) A (a V c) a A (b V c) = (a A b) V (a A c) distributivity 

a V — >a =1 a A — >a = 0 complements 

a V a = a a A a = a idempotency 

aV0 = a aAl = a boundedness 

aVl = 1 a A 0 = 0 

— 10 = 1 — il = 0 0 and 1 are complements 

-i (a V b) = -<a A --6 ->(a A 6) = -*z V ->6 de Morgan's laws 

- 1— >a = a involution 

The first three properties define a lattice; the first five define a Boolean algebra. The remaining five are a 
consequence of the first five. 



Other notations 

Mathematicians and engineers often use plus (+) for OR and a product sign ( • ) for AND. OR and AND are 
somewhat analogous to addition and multiplication in other algebraic structures, and this notation makes it very easy 
to get sum of products form for normal algebra. NOT may be represented by a line drawn above the expression being 
negated ( x )• It also commonly leads to giving • a higher precedence than +, removing the need for parenthesis in 
some cases. 

Programmers will often use a pipe symbol (I) for OR, an ampersand (&) for AND, and a tilde (~) for NOT. In many 
programming languages, these symbols stand for bitwise operations. "II", "&&", and "!" are used for variants of these 
operations. 

Another notation uses "meet" for AND and "join" for OR. However, this can lead to confusion, as the term "join" is 
also commonly used for any Boolean operation which combines sets together, which includes both AND and OR. 
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Basic mathematics use of Boolean terms 

• In the case of simultaneous equations, they are connected with an implied logical AND: 

x + y = 2 
AND 
x - y = 2 

• The same applies to simultaneous inequalities: 

x + y < 2 
AND 

x-y <2 

• The greater than or equals sign ( > ) and less than or equals sign ( < ) may be assumed to contain a logical 
OR: 

X<2 
OR 
X = 2 

• The plus/minus sign ( ± ), as in the case of the solution to a square root problem, may be taken as logical OR: 

WIDTH = 3 
OR 

WIDTH = -3 

English language use of Boolean terms 

Care should be taken when converting an English sentence into a formal boolean statement. Many English sentences 
have imprecise meanings. 

In certain cases, AND and OR can be used interchangeably in English: 

• / always carry an umbrella for when it rains and snows. 

• / always carry an umbrella for when it rains or snows. 

• / never walk in the rain or snow. 

Sometimes the English words "and" and "or" have a meaning that is apparently opposite of its meaning in boolean 
logic: 

• "Give me all the red and blue berries," usually means, "Give me all berries that are red or blue". (The former 
might have been interpreted as a request for berries that are each both red and blue.) An alternative phrasing for 
this request would be, "Give me all berries that are red and all berries that are blue." 

Depending on the context, the word "or" may correspond with either logical OR or logical XOR: 

• The waitress asked, "Would you like cream or sugar with your coffee?" (Logical OR.) 

• The waitress asked, "Would you like soup or salad with your meal? " (Logical XOR.) 

Logical XOR can be translated as "one, or the other, but not both". In most cases, this concept is most effectively 
communicated in English using "either/or". 

The word combination "and/or" is sometimes used in English to specify a logical OR, when just using the word "or" 
alone might have been mistaken as meaning logical XOR: 

• "I'm having chicken and/or beef for dinner." (Logical OR.) An alternative phrasing for standard written English 
would be, "For dinner, I'm having chicken or beef (or both)." 
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• The use of "and/or" is generally disfavored in formal writing. Its usage may introduce critical imprecision in 
legal agreements, research findings, and specifications for computer programs or electronic circuits. 

This can be a significant challenge when providing precise specifications for a computer program or electronic 
circuit in English. The description of such functionality may be ambiguous. Take for example the statement, "The 
program should verify that the applicant has checked the male or female box." This should be interpreted as an XOR 
and a verification performed to ensure that one, and only one, box is selected. In other cases the proper interpretation 
of English may be less obvious; the author of the specification should be consulted to determine the original intent. 

Applications 

Digital electronic circuit design 

Boolean logic is also used for circuit design in electrical engineering; here 0 and 1 may represent the two different 
states of one bit in a digital circuit, typically high and low voltage. Circuits are described by expressions containing 
variables, and two such expressions are equal for all values of the variables if, and only if, the corresponding circuits 
have the same input-output behavior. Furthermore, every possible input-output behavior can be modeled by a 
suitable Boolean expression. 

Basic logic gates such as AND, OR, and NOT gates may be used alone, or in conjunction with NAND, NOR, and 
XOR gates, to control digital electronics and circuitry. Whether these gates are wired in series or parallel controls the 
precedence of the operations. 

Database applications 

Relational databases use SQL, or other database-specific languages, to perform queries, which may contain Boolean 
logic. For this application, each record in a table may be considered to be an "element" of a "set". For example, in 
SQL, these SELECT statements are used to retrieve data from tables in the database: 



SELECT 

' James ' 


* FROM 

} 


employees 


WHERE 


last_name 


= 'Dean' AND 


first_name = 


SELECT 

' James ' 


* FROM 

r 


employees 


WHERE 


last_name 


= 'Dean' OR 


first_name 




SELECT 


* FROM 


employees 


WHERE 


NOT last_ 


name = 'Dean' 





Parentheses may be used to explicitly specify the order in which Boolean operations occur, when multiple operations 
are present: 

SELECT * FROM employees WHERE (NOT last_name = 'Smith') AND (first_name 
= 'John' OR first_name = 'Mary') ; 



Multiple sets of nested parentheses may also be used, where needed. 

Any Boolean operation (or operations) which combines two (or more) tables together is referred to as a join, in 
relational database terminology. 

In the field of Electronic Medical Records, some software applications use Boolean logic to query their patient 
databases, in what has been named Concept Processing technology. 
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Search engine queries 

Search engine queries also employ Boolean logic. For this application, each web page on the Internet may be 
considered to be an "element" of a "set". The following examples use a syntax supported by Google. 1 

• Doublequotes are used to combine whitespace-separated words into a single search term. 1 

• Whitespace is used to specify logical AND, as it is the default operator for joining search terms: 

"Search term 1" "Search term 2" 

• The OR keyword is used for logical OR: 

"Search term 1" OR "Search term 2" 

• The minus sign is used for logical NOT (AND NOT): 

"Search term 1" -"Search term 2" 

Notes and references 

[1] Usage Guide (http://www.ntsc.navy.mi1/Resources/Library/Acqguide/SpecWord.htm#and_or). 

[2] Not all search engines support the same query syntax. Additionally, some organizations provide "specialized" search engines that support 
alternate or extended syntax. (See e.g., Syntax Cheatsheet (http://www.google.com/help/cheatsheet.html), Google codesearch supports 
regular expressions (http://www.google.eom/intl/en/help/faq_codesearch.html#regexp)). 

[3] Doublequote-delimited search terms are called "exact phrase" searches in the Google documentation. 

External links 

• The Calculus of Logic (http://www.maths.tcd .ie/pub/HistMath/People/Boole/CalcLogic/CalcLogic.html), 
by George Boole, Cambridge and Dublin Mathematical Journal Vol. Ill (1848), pp. 183-98. 

• Logical Formula Evaluator (http://sourceforge.net/projects/logicaleval/) (for Windows), a software which 
calculates all possible values of a logical formula 

• Maiki & Boaz BDD-PROJECT (http://www.bdd-project.com), a Web Application for BDD reduction and 
visualization. 
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Algebraic logic 

In mathematical logic, algebraic logic is the study of logic presented in an algebraic style. 

Algebras as models of logics 

Algebraic logic treats algebraic structures, often bounded lattices, as models (interpretations) of certain logics, 
making logic a branch of order theory. 

In algebraic logic: 

• Variables are tacitly universally quantified over some universe of discourse. There are no existentially quantified 
variables or open formulas; 

• Terms are built up from variables using primitive and defined operations. There are no connectives; 

• Formulas, built from terms in the usual way, can be equated if they are logically equivalent. To express a 
tautology, equate a formula with a truth value; 

• The rules of proof are the substitution of equals for equals, and uniform replacement. Modus ponens remains 
valid, but is seldom employed. 

In the table below, the left column contains one or more logical or mathematical systems, and the algebraic structure 
which are its models are shown on the right in the same row. Some of these structures are either Boolean algebras or 
proper extensions thereof. Modal and other nonclassical logics are typically modeled by what are called "Boolean 
algebras with operators." 

Algebraic formalisms going beyond first-order logic in at least some respects include: 

• Combinatory logic, having the expressive power of set theory; 

• Relation algebra, arguably the paradigmatic algebraic logic, can express Peano arithmetic and most axiomatic set 
theories, including the canonical ZFC. 



logical system 


its models 


Classical sentential logic 


Lindenbaum-Tarski algebra Two-element Boolean algebra 


Intuitionistic propositional logic 


Heyting algebra 


Lukasiewicz logic 


MV-algebra 


Modal logic K 


Modal algebra 


Lewis's S4 


Interior algebra 


Lewis's S5; Monadic predicate logic 


Monadic Boolean algebra 


First-order logic 


Cylindric algebra Polyadic algebra 
Predicate functor logic 


Set theory 


Combinatory logic Relation algebra 
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History 

On the history of algebraic logic before World War II, see Brady (2000) and Grattan-Guinness (2000) and their 
ample references. On the postwar history, see Maddux (1991) and Quine (1976). 

Algebraic logic has at least two meanings: 

• The study of Boolean algebra, begun by George Boole, and of relation algebra, begun by Augustus DeMorgan, 
extended by Charles Sanders Peirce, and taking definitive form in the work of Ernst Schroder; 

• Abstract algebraic logic, a branch of contemporary mathematical logic. 

Perhaps surprisingly, algebraic logic is the oldest approach to formal logic, arguably beginning with a number of 
memoranda Leibniz wrote in the 1680s, some of which were published in the 19th century and translated into 
English by Clarence Lewis in 1918. But nearly all of Leibniz's known work on algebraic logic was published only in 
1903, after Louis Couturat discovered it in Leibniz's Nachlass. Parkinson (1966) and Loemker (1969) translated 
selections from Couturat' s volume into English. 

Brady (2000) discusses the rich historical connections between algebraic logic and model theory. The founders of 
model theory, Ernst Schroder and Leopold Loewenheim, were logicians in the algebraic tradition. Alfred Tarski, the 
founder of set theoretic model theory as a major branch of contemporary mathematical logic, also: 

• Co-discovered Lindenbaum-Tarski algebra; 

• Invented cylindric algebra; 

• Wrote the 1941 paper that revived relation algebra, and that can be seen as the starting point of abstract algebraic 
logic. 

Modern mathematical logic began in 1847, with two pamphlets whose respective authors were Augustus DeMorgan 
and George Boole. They, and later C.S. Peirce, Hugh MacColl, Frege, Peano, Bertrand Russell, and A. N. Whitehead 
all shared Leibniz's dream of combining symbolic logic, mathematics, and philosophy. Relation algebra is arguably 
the culmination of Leibniz's approach to logic. With the exception of some writings by Leopold Loewenheim and 
Thoralf Skolem, algebraic logic went into eclipse soon after the 1910-13 publication of Principia Mathematica, not 
to revive until Tarski's 1940 reexposition of relation algebra. 

Leibniz had no influence on the rise of algebraic logic because his logical writings were little studied before the 
Parkinson and Loemker translations. Our present understanding of Leibniz the logician stems mainly from the work 
of Wolfgang Lenzen, summarized in Lenzen (2004). ^ To see how present-day work in logic and metaphysics can 
draw inspiration from, and shed light on, Leibniz's thought, see Zalta (2000). 
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External links 

• Stanford Encyclopedia of Philosophy: "Propositional Consequence Relations and Algebraic Logic ^ M — by 
Ramon Jansana. 
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Lukasiewicz logic 

In mathematics, Lukasiewicz logic (English pronunciation: /lu:k9'}£vit}/, Polish pronunciation: [wuka'G£v J itg]) is a 
non-classical, many valued logic. It was originally defined in the early 20th-century by Jan Lukasiewicz as a 
three- valued logic; ^ it was later generalized to ft- valued (for all finite n) as well as infinitely-many- valued variants, 
both propositional and first-order It belongs to the classes of t-norm fuzzy logics^ and substructural logics ^ 

Language 

The propositional connectives of Lukasiewicz logic are implication — > , negation — ■ , equivalence w , weak 
conjunction A , strong conjunction (g) , weak disjunction V , strong disjunction © , and propositional constants 
(jand J. The presence of weak and strong conjunction and disjunction is a common feature of substructural logics 
without the rule of contraction, among which Lukasiewicz logic belongs. 

Axioms 

The original system of axioms for propositional infinite- valued Lukasiewicz logic used implication and negation as 
the primitive connectives: 

A _> (B -> A) 

(A^B)^((B-^C)->(A^C)) 
((A^B)^B)->((B^A)^A) 
(-•S — > ->A) -> (A^B). 

Propositional infinite-valued Lukasiewicz logic can also be axiomatized by adding the following axioms to the 
axiomatic system of monoidal t-norm logic: 

• Divisibility: (A A B) -> (A <g> (A -> B)) 

• Double negation: —\—\A — > A. 

That is, infinite- valued Lukasiewicz logic arises by adding the axiom of double negation to basic t-norm logic BL, or 
by adding the axiom of divisibility to the logic IMTL. 
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Real-valued semantics 

Infinite- valued Lukasiewicz logic is a real- valued logic in which sentences from sentential calculus may be assigned 
a truth value of not only zero or one but also any real number in between (eg. 0.25). Valuations have a recursive 
definition where: 

• w{6 o0) = F Q (w(6) : w((j)))for a binary connective °, 

• w{^0) = F^(w{0)), 

• w(0) =0and™(l) = 1, 

and where the definitions of the operations hold as follows: 

• Implication: y) = min{l, 1 — x + y} 

• Equivalence: F++(z : y) = 1 — \x — y\ 

• Negation: F^{x) = 1 — x 

• Weak Conjunction: F A (x, y) = min{x, y} 

• Weak Disjunction: F v (x : y) = max{x, y} 

• Strong Conjunction: F®(x : y) — maxjO, x + y — 1} 

• Strong Disjunction: F®(x, y) — min{l, x + y}. 

The truth function F® of strong conjunction is the Lukasiewicz t-norm and the truth function Fq of strong 
disjunction is its dual t-conorm. The truth function i^is the residuum of the Lukasiewicz t-norm. All truth 
functions of the basic connectives are continuous. 

By definition, a formula is a tautology of infinite- valued Lukasiewicz logic if it evaluates to 1 under any valuation of 
propositional variables by real numbers in the interval [0, 1]. 

General algebraic semantics 

The standard real-valued semantics determined by the Lukasiewicz t-norm is not the only possible semantics of 
Lukasiewicz logic. General algebraic semantics of propositional infinite-valued Lukasiewicz logic is formed by the 
class of all MV-algebras. The standard real-valued semantics is a special MV-algebra, called the standard 
MV -algebra. 

Like other t-norm fuzzy logics, propositional infinite- valued Lukasiewicz logic enjoys completeness with respect to 
the class of all algebras for which the logic is sound (that is, MV-algebras) as well as with respect to only linear 
ones. This is expressed by the general, linear, and standard completeness theorems: 

The following conditions are equivalent: 

• A is provable in propositional infinite- valued Lukasiewicz logic 

• A is valid in all MV-algebras (general completeness) 

• A is valid in all linearly ordered MV-algebras (linear completeness) 

• A is valid in the standard MV-algebra (standard completeness). 
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Intuitionistic logic 

Intuitionistic logic, or constructive logic, is a symbolic logic system that differs from classical logic in its definition 
of what it means for a statement to be true. In classical logic, all well-formed statements are assumed to be either true 
or false, even if we do not have a proof of either. In constructive logic, a statement is only true if there is a proof that 
it is true, and only false if there is a proof that it is false. Operations in constructive logic preserve justification, rather 
than truth. Syntactically, intuitionist logic differs from classical logic in that the law of excluded middle and double 
negation elimination are not axioms of the system, and cannot be proved in it. 

Constructive logic is practically useful because its restrictions produce proofs that have the existence property, 
making it also suitable for other forms of mathematical constructivism. Informally, this means that if you have a 
constructive proof that an object exists, you can turn that constructive proof into an algorithm for generating an 
example of it. 

It was originally developed by Arend Heyting to provide a formal basis for Brouwer's programme of intuitionism. 

Syntax 

The syntax of formulas of intuitionistic logic is 
similar to propositional logic or first-order logic. 
However, intuitionistic connectives are not 
definable in terms of each other in the same way 
as in classical logic, hence their choice matters. 
In intuitionistic propositional logic it is 
customary to use a, v, ± as the basic 
connectives, treating -iA as an abbreviation for 
(A — » ±). In intuitionistic first-order logic both 
quantifiers 3, V are needed. 

Many tautologies of classical logic can no longer 
be proven within intuitionistic logic. Examples 
include not only the law of excluded middle p v 
-1/7, but also Peirce's law ((/? — » q) —> p) —> p, 
and even double negation elimination. In 
classical logic, both p —> — ■/? and also -i-p —> p 
are theorems. In intuitionistic logic, only the 
former is a theorem: double negation can be 
introduced, but it cannot be eliminated. Rejecting p v -*p may seem strange to those more familiar with classical 
logic, but proving this statement in constructive logic would require producing a proof for the truth or falsity of all 
possible statements, which is impossible for a variety of reasons. 

Because many classically valid tautologies are not theorems of intuitionistic logic, but all theorems of intuitionist 
logic are valid classically, intuitionist logic can be viewed as a weakening of classical logic, albeit one with many 
useful properties. 



(((-n-.p > p) - (p v ->p)) -* (-pv p» 



((-•-?-» p)-(pv-p)) 
-(-.pv-.-.rt 



- (-.-.pv p - p)) 




The Rieger-Nishimura lattice. Its nodes are the propositional formulas in one 
variable up to intuitionistic logical equivalence, ordered by intuitionistic 
logical implication. 
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Sequent calculus 

Gentzen discovered that a simple restriction of his system LK (his sequent calculus for classical logic) results in a 
system which is sound and complete with respect to intuitionistic logic. He called this system LJ. In LK any number 
of formulas is allowed to appear on the conclusion side of a sequent; in contrast LJ allows at most one formula in 
this position. 

Other derivatives of LK are limited to intuitionstic derivations but still allow multiple conclusions in a sequent. LJ' 
^ is one example. 

Hilbert-style calculus 

Intuitionistic logic can be defined using the following Hilbert-style calculus. Compare with the deduction system at 
Propositional calculus# Alternative calculus. 

In propositional logic, the inference rule is modus ponens 

• MP: from 0 and 0 — > 0 infer 0 
and the axioms are 

• THEN-1: 0 -> ( X -> 0) 

• THEN-2: (0 -> ( X -> $)) -> ((0 _> x ) -> (0 _> ^)) 

• AND-1: 0 A x — > 0 

• AND-2: 0 A x — > X 

• AND-3:0^( X ^(0A X )) 

• OR-1: 0 — > 0 V x 

• OR-2: x -> 0 V x 

• (0^^)^((X^^)^(0VX^^)) 

• FALSE: _L — > 0 

To make this a system of first-order predicate logic, the generalization rules 

• V -GEN: from ip — > 0 infer ip — > (Vx 0) , if x is not free in %j) 

• 3 -GEN: from 0—^0 infer (3x 0) — > if; , if x is not free in -0 
are added, along with the axioms 

• PRED-1: (Vx (/){x)) — > 0(i) , if the term t is free for substitution for the variable x in 0 (i.e., if no occurrence 

of any variable in t becomes bound in (j)(t) ) 

• PRED-2: (j)(t) — > (3x 0(x)) , with the same restriction as for PRED-1 

Optional connectives 
Negation 

If one wishes to include a connective — i for negation rather than consider it an abbreviation for 0 — > _L , it is 
enough to add: 

• NOT-1': (0 -4 _L) -> -10 

• NOT-2': -10 _> (0 _> _L) 

There are a number of alternatives available if one wishes to omit the connective _L (false). For example, one may 
replace the three axioms FALSE, NOT-1', and NOT-2' with the two axioms 

. NOT-1: (0 -> x ) -> ({0 -> -> 

• NOT-2: 0 (-.0 -> 

as at Propositional calculus#Axioms. Alternatives to NOT-1 are (0 — > — i^) — ► — ► — >0)or 
(0 _^ -.0) _^ ^0 . 



Intuitionistic logic 



484 



Equivalence 

The connective <-> for equivalence may be treated as an abbreviation, with 0 X standing for 
(0 ~ > X) A (x ~~ * 0) • Alternatively, one may add the axioms 
. IFF-1:(0<-> X )->(0_> X ) 

• IFF-2: (0 <-> x ) _> ( x _> 0) 

. IFF-3: (0 X ) -> (( X -> 0) -> (0 <-> X )) 

IFF-1 and IFF-2 can, if desired, be combined into a single axiom (0 x) ~ ^ ((0 ~~ ^ x) A (x ~* 0)) usm S 
conjunction. 

Relation to classical logic 

The system of classical logic is obtained by adding any one of the following axioms: 

• 0 V — >0 (Law of the excluded middle. May also be formulated as (0 — ■» ^) — * ((-i0 — > x) — > x) •) 

• — i— 10 — > 0 (Double negation elimination) 

• ((0 — > x) — * 0) — > 0 (Peirce's law) 

In general, one may take as the extra axiom any classical tautology that is not valid in the two-element Kripke frame 
o >o (in other words, that is not included in Smetanich's logic). 

Another relationship is given by the Godel-Gentzen negative translation, which provides an embedding of classical 
first-order logic into intuitionistic logic: a first-order formula is provable in classical logic if and only if its 
Godel-Gentzen translation is provable intuitionistically. Therefore intuitionistic logic can instead be seen as a means 
of extending classical logic with constructive semantics. 

Non-interdefinability of operators 

In classical propositional logic, it is possible to take one of conjunction, disjunction, or implication as primitive, and 
define the other two in terms of it together with negation, such as in Lukasiewicz's three axioms of propositional 
logic. It is even possible to define all four in terms of a sole sufficient operator such as the Peirce arrow (NOR) or 
Sheffer stroke (NAND). Similarly, in classical first-order logic, one of the quantifiers can be defined in terms of the 
other and negation. 

These are fundamentally consequences of the law of bivalence, which makes all such connectives merely Boolean 
functions. The law of bivalence does not hold in intuitionistic logic, only the law of non-contradiction. As a result 
none of the basic connectives can be dispensed with, and the above axioms are all necessary. Most of the classical 
identities are only theorems of intuitionistic logic in one direction, although some are theorems in both directions. 
They are as follows: 

Conjunction versus disjunction: 

• (0 A if}) -> -.(-.0 V -nV) 

• (0 v ip) -> -.(-.0 a 

• (-.0 V -<0) -> ^(0 A VO 

• (-10 A -i^) <r-> -i(0 V if)) 
Conjunction versus implication: 

• (0 A if)) — > -<(0 — > -i^) 

• (0 — > VO ~ * ->(0 A -i^) 

• (0 A -1-0) — > -i(0 — > if)) 

• (0 — > ->0) <-> ->(0 A if)) 
Disjunction versus implication: 

• (0 V y)^H>^V) 
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• (-»0V^)->(0->V) 

• — > V) _, ( — '0 V '0) 

• V <-> — i( — 10 — > -0) 

Universal versus existential quantification: 

• (Vx 0(x)) — > -i(3x -^(x)) 

• (3x 0(x)) -> -(Vx -0(x)) 

• (3x -0(x)) -> -.(Vx 0(x)) 

• (Vx _, 0(x)) <-» -<(3x 0(x)) 

So, for example, "a or b M is a stronger statement than "if not a, then b", whereas these are classically interchangeable. 
On the other hand, "neither a nor b" is equivalent to "not a, and also not b". 

If we include equivalence in the list of connectives, some of the connectives become definable from others: 

• (0 -> if)) <-> ((0 V ^) <r+ if}) 

• (0 — > if)) ((0 A <-> 0) 

• (0A^)^((0^^)^0) 

• (0 A -0) <-> (((0 V 0) <-> 0) <-> 0) 

In particular, { v, <-», ±} and { v, <-», ->} are complete bases of intuitionistic connectives. 

As shown by Alexander Kuznetsov, either of the following defined connectives can serve the role of a sole sufficient 
operator for intuitionistic logic: 

• ((p V q) A -it) V A (5 <-> r)) 5 

• (9 A nr A (s V i)). 

Semantics 

The semantics are rather more complicated than for the classical case. A model theory can be given by Heyting 
algebras or, equivalently, by Kripke semantics. 

Heyting algebra semantics 

In classical logic, we often discuss the truth values that a formula can take. The values are usually chosen as the 
members of a Boolean algebra. The meet and join operations in the Boolean algebra are identified with the a and v 
logical connectives, so that the value of a formula of the form A a B is the meet of the value of A and the value of B 
in the Boolean algebra. Then we have the useful theorem that a formula is a valid sentence of classical logic if and 
only if its value is 1 for every valuation — that is, for any assignment of values to its variables. 

A corresponding theorem is true for intuitionistic logic, but instead of assigning each formula a value from a Boolean 
algebra, one uses values from a Heyting algebra, of which Boolean algebras are a special case. A formula is valid in 
intuitionistic logic if and only if it receives the value of the top element for any valuation on any Heyting algebra. 

It can be shown that to recognize valid formulas, it is sufficient to consider a single Heyting algebra whose elements 
are the open subsets of the real line R. In this algebra, the a and v operations correspond to set intersection and 
union, and the value assigned to a formula A — » B is int(A u B), the interior of the union of the value of B and the 
complement of the value of A. The bottom element is the empty set 0, and the top element is the entire line R. The 
negation -*A of a formula A is (as usual) defined to be A — » 0. The value of -*A then reduces to int(A ), the interior 
of the complement of the value of A, also known as the exterior of A. With these assignments, intuitionistically valid 
formulas are precisely those that are assigned the value of the entire line. 1 

For example, the formula ->(A a -iA) is valid, because no matter what set X is chosen as the value of the formula A, 
the value of ->(A a -iA) can be shown to be the entire line: 
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ValueHA a -iA)) = 
int((Value(A a ^A)) C ) = 
int((Value(A) n Value(^A)) C ) = 
int((X n int((Value(A)) C )) C ) = 
intCCXnintCX 0 )) 0 ) 

A theorem of topology tells us that intCX 0 ) is a subset of , so the intersection is empty, leaving: 

int(0 C ) = int(R) = R 
So the valuation of this formula is true, and indeed the formula is valid. 

But the law of the excluded middle, A v -A, can be shown to be invalid by letting the value of A be {y : y > 0 }. 
Then the value of -iA is the interior of {y : y < 0 }, which is {y : y < 0 }, and the value of the formula is the union of 
{y : y > 0 } and {y : y < 0 }, which is : v * 0 }, not the entire line. 

The interpretation of any intuitionistically valid formula in the infinite Heyting algebra described above results in the 
top element, representing true, as the valuation of the formula, regardless of what values from the algebra are 
assigned to the variables of the formula. Conversely, for every invalid formula, there is an assignment of values to 
the variables that yields a valuation that differs from the top element.^ ^ No finite Heyting algebra has both these 
properties.^ 

Kripke semantics 

Building upon his work on semantics of modal logic, Saul Kripke created another semantics for intuitionistic logic, 
known as Kripke semantics or relational semantics ^ . 

Relation to other logics 

Intutionistic logic is related by duality to a paraconsistent logic known as Brazilian, anti-intuitionistic or 
dual-intuitionistic logic . 

The subsystem of intuitionistic logic with the FALSE axiom removed is known as minimal logic. 
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Mathematical logic 

Mathematical logic (also known as symbolic logic) is a subfield of mathematics with close connections to computer 
science and philosophical logic. ^ The field includes both the mathematical study of logic and the applications of 
formal logic to other areas of mathematics. The unifying themes in mathematical logic include the study of the 
expressive power of formal systems and the deductive power of formal proof systems. 

Mathematical logic is often divided into the fields of set theory, model theory, recursion theory, and proof theory. 
These areas share basic results on logic, particularly first-order logic, and definability. In computer science 
(particularly in the ACM Classification) mathematical logic encompasses additional topics not detailed in this 
article; see logic in computer science for those. 

Since its inception, mathematical logic has contributed to, and has been motivated by, the study of foundations of 
mathematics. This study began in the late 19th century with the development of axiomatic frameworks for geometry, 
arithmetic, and analysis. In the early 20th century it was shaped by David Hilbert's program to prove the consistency 
of foundational theories. Results of Kurt Godel, Gerhard Gentzen, and others provided partial resolution to the 
program, and clarified the issues involved in proving consistency. Work in set theory showed that almost all ordinary 
mathematics can be formalized in terms of sets, although there are some theorems that cannot be proven in common 
axiom systems for set theory. Contemporary work in the foundations of mathematics often focuses on establishing 
which parts of mathematics can be formalized in particular formal systems, rather than trying to find theories in 
which all of mathematics can be developed. 

History 

Mathematical logic emerged in the mid- 19th century as a subfield of mathematics independent of the traditional 
study of logic (Ferreiros 2001, p. 443). Before this emergence, logic was studied with rhetoric, through the 
syllogism, and with philosophy. The first half of the 20th century saw an explosion of fundamental results, 
accompanied by vigorous debate over the foundations of mathematics. 

Early history 

Sophisticated theories of logic were developed in many cultures, including China, India, Greece and the Islamic 
world. In the 18th century, attempts to treat the operations of formal logic in a symbolic or algebraic way had been 
made by philosophical mathematicians including Leibniz and Lambert, but their labors remained isolated and little 
known. 
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19th century 

In the middle of the nineteenth century, George Boole and then Augustus De Morgan presented systematic 
mathematical treatments of logic. Their work, building on work by algebraists such as George Peacock, extended the 
traditional Aristotelian doctrine of logic into a sufficient framework for the study of foundations of 
mathematics (Katz 1998, p. 686). 

Charles Sanders Peirce built upon the work of Boole to develop a logical system for relations and quantifiers, which 
he published in several papers from 1870 to 1885. Gottlob Frege presented an independent development of logic 
with quantifiers in his Be 'griffs schrift, published in 1879, a work generally considered as marking a turning point in 
the history of logic. Frege's work remained obscure, however, until Bertrand Russell began to promote it near the 
turn of the century. The two-dimensional notation Frege developed was never widely adopted and is unused in 
contemporary texts. 

From 1890 to 1905, Ernst Schroder published Vorlesungen iiber die Algebra der Logik in three volumes. This work 
summarized and extended the work of Boole, De Morgan, and Peirce, and was a comprehensive reference to 
symbolic logic as it was understood at the end of the 19th century. 

Foundational theories 

Some concerns that mathematics had not been built on a proper foundation led to the development of axiomatic 
systems for fundamental areas of mathematics such as arithmetic, analysis, and geometry. 

In logic, the term arithmetic refers to the theory of the natural numbers. Giuseppe Peano (1888) published a set of 
axioms for arithmetic that came to bear his name (Peano axioms), using a variation of the logical system of Boole 
and Schroder but adding quantifiers. Peano was unaware of Frege's work at the time. Around the same time Richard 
Dedekind showed that the natural numbers are uniquely characterized by their induction properties. Dedekind (1888) 
proposed a different characterization, which lacked the formal logical character of Peano's axioms. Dedekind's work, 
however, proved theorems inaccessible in Peano's system, including the uniqueness of the set of natural numbers (up 
to isomorphism) and the recursive definitions of addition and multiplication from the successor function and 
mathematical induction. 

In the mid- 19th century, flaws in Euclid's axioms for geometry became known (Katz 1998, p. 774). In addition to the 
independence of the parallel postulate, established by Nikolai Lobachevsky in 1826 (Lobachevsky 1840), 
mathematicians discovered that certain theorems taken for granted by Euclid were not in fact provable from his 
axioms. Among these is the theorem that a line contains at least two points, or that circles of the same radius whose 
centers are separated by that radius must intersect. Hilbert (1899) developed a complete set of axioms for geometry, 
building on previous work by Pasch (1882). The success in axiomatizing geometry motivated Hilbert to seek 
complete axiomatizations of other areas of mathematics, such as the natural numbers and the real line. This would 
prove to be a major area of research in the first half of the 20th century. 

The 19th century saw great advances in the theory of real analysis, including theories of convergence of functions 
and Fourier series. Mathematicians such as Karl Weierstrass began to construct functions that stretched intuition, 
such as nowhere-differentiable continuous functions. Previous conceptions of a function as a rule for computation, or 
a smooth graph, were no longer adequate. Weierstrass began to advocate the arithmetization of analysis, which 
sought to axiomatize analysis using properties of the natural numbers. The modern (e, 6)-definition of limit and 
continuous functions was already developed by Bolzano in 1817 (Felscher 2000), but remained relatively unknown. 
Cauchy in 1821 defined continuity in terms of infinitesimals (see Cours d' Analyse, page 34). In 1858, Dedekind 
proposed a definition of the real numbers in terms of Dedekind cuts of rational numbers (Dedekind 1872), a 
definition still employed in contemporary texts. 

Georg Cantor developed the fundamental concepts of infinite set theory. His early results developed the theory of 
cardinality and proved that the reals and the natural numbers have different cardinalities (Cantor 1874). Over the 
next twenty years, Cantor developed a theory of transfinite numbers in a series of publications. In 1891, he published 
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a new proof of the uncountability of the real numbers that introduced the diagonal argument, and used this method to 
prove Cantor's theorem that no set can have the same cardinality as its powerset. Cantor believed that every set could 
be well-ordered, but was unable to produce a proof for this result, leaving it as an open problem in 1895 (Katz 1998, 
p. 807). 

20th century 

In the early decades of the 20th century, the main areas of study were set theory and formal logic. The discovery of 
paradoxes in informal set theory caused some to wonder whether mathematics itself is inconsistent, and to look for 
proofs of consistency. 

In 1900, Hilbert posed a famous list of 23 problems for the next century. The first two of these were to resolve the 
continuum hypothesis and prove the consistency of elementary arithmetic, respectively; the tenth was to produce a 
method that could decide whether a multivariate polynomial equation over the integers has a solution. Subsequent 
work to resolve these problems shaped the direction of mathematical logic, as did the effort to resolve Hilbert' s 
Entscheidungsproblem, posed in 1928. This problem asked for a procedure that would decide, given a formalized 
mathematical statement, whether the statement is true or false. 

Set theory and paradoxes 

Ernst Zermelo (1904) gave a proof that every set could be well-ordered, a result Georg Cantor had been unable to 
obtain. To achieve the proof, Zermelo introduced the axiom of choice, which drew heated debate and research 
among mathematicians and the pioneers of set theory. The immediate criticism of the method led Zermelo to publish 
a second exposition of his result, directly addressing criticisms of his proof (Zermelo 1908a). This paper led to the 
general acceptance of the axiom of choice in the mathematics community. 

Skepticism about the axiom of choice was reinforced by recently discovered paradoxes in naive set theory. Cesare 
Burali-Forti (1897) was the first to state a paradox: the Burali-Forti paradox shows that the collection of all ordinal 
numbers cannot form a set. Very soon thereafter, Bertrand Russell discovered Russell's paradox in 1901, and Jules 
Richard (1905) discovered Richard's paradox. 

Zermelo (1908b) provided the first set of axioms for set theory. These axioms, together with the additional axiom of 
replacement proposed by Abraham Fraenkel, are now called Zermelo-Fraenkel set theory (ZF). Zermelo's axioms 
incorporated the principle of limitation of size to avoid Russell's paradox. 

In 1910, the first volume of Principia Mathematica by Russell and Alfred North Whitehead was published. This 
seminal work developed the theory of functions and cardinality in a completely formal framework of type theory, 
which Russell and Whitehead developed in an effort to avoid the paradoxes. Principia Mathematica is considered 
one of the most influential works of the 20th century, although the framework of type theory did not prove popular 
as a foundational theory for mathematics (Ferreiros 2001, p. 445). 

Fraenkel (1922) proved that the axiom of choice cannot be proved from the remaining axioms of Zermelo's set 
theory with urelements. Later work by Paul Cohen (1966) showed that the addition of urelements is not needed, and 
the axiom of choice is unprovable in ZF. Cohen's proof developed the method of forcing, which is now an important 
tool for establishing independence results in set theory. 



Mathematical logic 



490 



Symbolic logic 

Leopold Lowenheim (1915) and Thoralf Skolem (1920) obtained the Lowenheim-Skolem theorem, which says that 
first-order logic cannot control the cardinalities of infinite structures. Skolem realized that this theorem would apply 
to first-order formalizations of set theory, and that it implies any such formalization has a countable model. This 
counterintuitive fact became known as Skolem's paradox. 

In his doctoral thesis, Kurt Godel (1929) proved the completeness theorem, which establishes a correspondence 
between syntax and semantics in first-order logic. Godel used the completeness theorem to prove the compactness 
theorem, demonstrating the finitary nature of first-order logical consequence. These results helped establish 
first-order logic as the dominant logic used by mathematicians. 

In 1931, Godel published On Formally Undecidable Propositions of Principia Mathematica and Related Systems, 
which proved the incompleteness (in a different meaning of the word) of all sufficiently strong, effective first-order 
theories. This result, known as Godel's incompleteness theorem, establishes severe limitations on axiomatic 
foundations for mathematics, striking a strong blow to Hilbert's program. It showed the impossibility of providing a 
consistency proof of arithmetic within any formal theory of arithmetic. Hilbert, however, did not acknowledge the 
importance of the incompleteness theorem for some time. 

Godel's theorem shows that a consistency proof of any sufficiently strong, effective axiom system cannot be obtained 
in the system itself, if the system is consistent, nor in any weaker system. This leaves open the possibility of 
consistency proofs that cannot be formalized within the system they consider. Gentzen (1936) proved the 
consistency of arithmetic using a finitistic system together with a principle of transfinite induction. Gentzen's result 
introduced the ideas of cut elimination and proof-theoretic ordinals, which became key tools in proof theory. Godel 
(1958) gave a different consistency proof, which reduces the consistency of classical arithmetic to that of 
intutitionistic arithmetic in higher types. 

Beginnings of the other branches 

Alfred Tarski developed the basics of model theory. 

Beginning in 1935, a group of prominent mathematicians collaborated under the pseudonym Nicolas Bourbaki to 
publish a series of encyclopedic mathematics texts. These texts, written in an austere and axiomatic style, 
emphasized rigorous presentation and set- theoretic foundations. Terminology coined by these texts, such as the 
words bijection, injection, and surjection, and the set-theoretic foundations the texts employed, were widely adopted 
throughout mathematics. 

The study of computability came to be known as recursion theory, because early formalizations by Godel and Kleene 
relied on recursive definitions of functions. When these definitions were shown equivalent to Turing's 
formalization involving Turing machines, it became clear that a new concept - the computable function — had been 
discovered, and that this definition was robust enough to admit numerous independent characterizations. In his work 
on the incompleteness theorems in 1931, Godel lacked a rigorous concept of an effective formal system; he 
immediately realized that the new definitions of computability could be used for this purpose, allowing him to state 
the incompleteness theorems in generality that could only be implied in the original paper. 

Numerous results in recursion theory were obtained in the 1940s by Stephen Cole Kleene and Emil Leon Post. 
Kleene (1943) introduced the concepts of relative computability, foreshadowed by Turing (1939), and the 
arithmetical hierarchy. Kleene later generalized recursion theory to higher-order functionals. Kleene and Kreisel 
studied formal versions of intuitionistic mathematics, particularly in the context of proof theory. 
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Subfields and scope 

The Handbook of Mathematical Logic makes a rough division of contemporary mathematical logic into four areas: 

1. set theory 

2. model theory 

3. recursion theory, and 

4. proof theory and constructive mathematics (considered as parts of a single area). 

Each area has a distinct focus, although many techniques and results are shared between multiple areas. The border 
lines between these fields, and the lines between mathematical logic and other fields of mathematics, are not always 
sharp. Godel's incompleteness theorem marks not only a milestone in recursion theory and proof theory, but has also 
led to Lob's theorem in modal logic. The method of forcing is employed in set theory, model theory, and recursion 
theory, as well as in the study of intuitionistic mathematics. 

The mathematical field of category theory uses many formal axiomatic methods, and includes the study of 
categorical logic, but category theory is not ordinarily considered a subfield of mathematical logic. Because of its 
applicability in diverse fields of mathematics, mathematicians including Saunders Mac Lane have proposed category 
theory as a foundational system for mathematics, independent of set theory. These foundations use toposes, which 
resemble generalized models of set theory that may employ classical or nonclassical logic. 

Formal logical systems 

At its core, mathematical logic deals with mathematical concepts expressed using formal logical systems. These 
systems, though they differ in many details, share the common property of considering only expressions in a fixed 
formal language, or signature. The system of first-order logic is the most widely studied today, because of its 
applicability to foundations of mathematics and because of its desirable proof-theoretic properties. 1 Stronger 
classical logics such as second-order logic or infinitary logic are also studied, along with nonclassical logics such as 
intuitionistic logic. 

First-order logic 

First-order logic is a particular formal system of logic. Its syntax involves only finite expressions as well-formed 
formulas, while its semantics are characterized by the limitation of all quantifiers to a fixed domain of discourse. 

Early results about formal logic established limitations of first-order logic. The Lowenheim-Skolem theorem (1919) 
showed that if a set of sentences in a countable first-order language has an infinite model then it has at least one 
model of each infinite cardinality. This shows that it is impossible for a set of first-order axioms to characterize the 
natural numbers, the real numbers, or any other infinite structure up to isomorphism. As the goal of early 
foundational studies was to produce axiomatic theories for all parts of mathematics, this limitation was particularly 
stark. 

Godel's completeness theorem (Godel 1929) established the equivalence between semantic and syntactic definitions 
of logical consequence in first-order logic. It shows that if a particular sentence is true in every model that satisfies a 
particular set of axioms, then there must be a finite deduction of the sentence from the axioms. The compactness 
theorem first appeared as a lemma in Godel's proof of the completeness theorem, and it took many years before 
logicians grasped its significance and began to apply it routinely. It says that a set of sentences has a model if and 
only if every finite subset has a model, or in other words that an inconsistent set of formulas must have a finite 
inconsistent subset. The completeness and compactness theorems allow for sophisticated analysis of logical 
consequence in first-order logic and the development of model theory, and they are a key reason for the prominence 
of first-order logic in mathematics. 

Godel's incompleteness theorems (Godel 1931) establish additional limits on first-order axiomatizations. The first 
incompleteness theorem states that for any sufficiently strong, effectively given logical system there exists a 
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statement which is true but not provable within that system. Here a logical system is effectively given if it is possible 
to decide, given any formula in the language of the system, whether the formula is an axiom. A logical system is 
sufficiently strong if it can express the Peano axioms. When applied to first-order logic, the first incompleteness 
theorem implies that any sufficiently strong, consistent, effective first-order theory has models that are not 
elementarily equivalent, a stronger limitation than the one established by the Lowenheim-Skolem theorem. The 
second incompleteness theorem states that no sufficiently strong, consistent, effective axiom system for arithmetic 
can prove its own consistency, which has been interpreted to show that Hilbert's program cannot be completed. 

Other classical logics 

Many logics besides first-order logic are studied. These include infinitary logics, which allow for formulas to 
provide an infinite amount of information, and higher-order logics, which include a portion of set theory directly in 
their semantics. 

The most well studied infinitary logic is L UliLJ . In this logic, quantifiers may only be nested to finite depths, as in 
first order logic, but formulas may have finite or countably infinite conjunctions and disjunctions within them. Thus, 
for example, it is possible to say that an object is a whole number using a formula of L UljLJ such as 

(x = 0) V (x = 1) V (x = 2) V - - - . 
Higher-order logics allow for quantification not only of elements of the domain of discourse, but subsets of the 
domain of discourse, sets of such subsets, and other objects of higher type. The semantics are defined so that, rather 
than having a separate domain for each higher-type quantifier to range over, the quantifiers instead range over all 
objects of the appropriate type. The logics studied before the development of first-order logic, for example Frege's 
logic, had similar set-theoretic aspects. Although higher-order logics are more expressive, allowing complete 
axiomatizations of structures such as the natural numbers, they do not satisfy analogues of the completeness and 
compactness theorems from first-order logic, and are thus less amenable to proof-theoretic analysis. 

Another type of logics are fixed-point logics that allow inductive definitions, like one writes for primitive recursive 
functions. 

One can formally define an extension of first-order logic — a notion which encompasses all logics in this section 
because they behave like first-order logic in certain fundamental ways, but does not encompass all logics in general, 
e.g. it does not encompass intuitionistic, modal or fuzzy logic. Lindstrom's theorem implies that the only extension of 
first-order logic satisfying both the Compactness theorem and the Downward Lowenheim-Skolem theorem is 
first-order logic. 

Nonclassical and modal logic 

Modal logics include additional modal operators, such as an operator which states that a particular formula is not 
only true, but necessarily true. Although modal logic is not often used to axiomatize mathematics, it has been used to 
study the properties of first-order provability (Solovay 1976) and set-theoretic forcing (Hamkins and Lowe 2007). 

Intuitionistic logic was developed by Heyting to study Brouwer's program of intuitionism, in which Brouwer himself 
avoided formalization. Intuitionistic logic specifically does not include the law of the excluded middle, which states 
that each sentence is either true or its negation is true. Kleene's work with the proof theory of intuitionistic logic 
showed that constructive information can be recovered from intuitionistic proofs. For example, any provably total 
function in intuitionistic arithmetic is computable; this is not true in classical theories of arithmetic such as Peano 
arithmetic. 
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Algebraic logic 

Algebraic logic uses the methods of abstract algebra to study the semantics of formal logics. A fundamental example 
is the use of Boolean algebras to represent truth values in classical propositional logic, and the use of Heyting 
algebras to represent truth values in intuitionistic propositional logic. Stronger logics, such as first-order logic and 
higher-order logic, are studied using more complicated algebraic structures such as cylindric algebras. 

Set theory 

Set theory is the study of sets, which are abstract collections of objects. Many of the basic notions, such as ordinal 
and cardinal numbers, were developed informally by Cantor before formal axiomatizations of set theory were 
developed. The first such axiomatization, due to Zermelo (1908b), was extended slightly to become 
Zermelo-Fraenkel set theory (ZF), which is now the most widely used foundational theory for mathematics. 

Other formalizations of set theory have been proposed, including von Neumann-Bernays-Godel set theory (NBG), 
Morse-Kelley set theory (MK), and New Foundations (NF). Of these, ZF, NBG, and MK are similar in describing a 
cumulative hierarchy of sets. New Foundations takes a different approach; it allows objects such as the set of all sets 
at the cost of restrictions on its set-existence axioms. The system of Kripke-Platek set theory is closely related to 
generalized recursion theory. 

Two famous statements in set theory are the axiom of choice and the continuum hypothesis. The axiom of choice, 
first stated by Zermelo (1904), was proved independent of ZF by Fraenkel (1922), but has come to be widely 
accepted by mathematicians. It states that given a collection of nonempty sets there is a single set C that contains 
exactly one element from each set in the collection. The set C is said to "choose" one element from each set in the 
collection. While the ability to make such a choice is considered obvious by some, since each set in the collection is 
nonempty, the lack of a general, concrete rule by which the choice can be made renders the axiom nonconstructive. 
Stefan Banach and Alfred Tarski (1924) showed that the axiom of choice can be used to decompose a solid ball into 
a finite number of pieces which can then be rearranged, with no scaling, to make two solid balls of the original size. 
This theorem, known as the Banach-Tarski paradox, is one of many counterintuitive results of the axiom of choice. 

The continuum hypothesis, first proposed as a conjecture by Cantor, was listed by David Hilbert as one of his 23 
problems in 1900. Godel showed that the continuum hypothesis cannot be disproven from the axioms of 
Zermelo-Fraenkel set theory (with or without the axiom of choice), by developing the constructible universe of set 
theory in which the continuum hypothesis must hold. In 1963, Paul Cohen showed that the continuum hypothesis 
cannot be proven from the axioms of Zermelo-Fraenkel set theory (Cohen 1966). This independence result did not 
completely settle Hilbert's question, however, as it is possible that new axioms for set theory could resolve the 
hypothesis. Recent work along these lines has been conducted by W. Hugh Woodin, although its importance is not 
yet clear (Woodin 2001). 

Contemporary research in set theory includes the study of large cardinals and determinacy. Large cardinals are 
cardinal numbers with particular properties so strong that the existence of such cardinals cannot be proved in ZFC. 
The existence of the smallest large cardinal typically studied, an inaccessible cardinal, already implies the 
consistency of ZFC. Despite the fact that large cardinals have extremely high cardinality, their existence has many 
ramifications for the structure of the real line. Determinacy refers to the possible existence of winning strategies for 
certain two-player games (the games are said to be determined). The existence of these strategies implies structural 
properties of the real line and other Polish spaces. 
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Model theory 

Model theory studies the models of various formal theories. Here a theory is a set of formulas in a particular formal 
logic and signature, while a model is a structure that gives a concrete interpretation of the theory. Model theory is 
closely related to universal algebra and algebraic geometry, although the methods of model theory focus more on 
logical considerations than those fields. 

The set of all models of a particular theory is called an elementary class; classical model theory seeks to determine 
the properties of models in a particular elementary class, or determine whether certain classes of structures form 
elementary classes. 

The method of quantifier elimination can be used to show that definable sets in particular theories cannot be too 
complicated. Tarski (1948) established quantifier elimination for real-closed fields, a result which also shows the 
theory of the field of real numbers is decidable. (He also noted that his methods were equally applicable to 
algebraically closed fields of arbitrary characteristic.) A modern subfield developing from this is concerned with 
o-minimal structures. 

Morley's categoricity theorem, proved by Michael D. Morley (1965), states that if a first-order theory in a countable 
language is categorical in some uncountable cardinality, i.e. all models of this cardinality are isomorphic, then it is 
categorical in all uncountable cardinalities. 

A trivial consequence of the continuum hypothesis is that a complete theory with less than continuum many 
nonisomorphic countable models can have only countably many. Vaught's conjecture, named after Robert Lawson 
Vaught, says that this is true even independently of the continuum hypothesis. Many special cases of this conjecture 
have been established. 

Recursion theory 

Recursion theory, also called computability theory, studies the properties of computable functions and the Turing 
degrees, which divide the uncomputable functions into sets which have the same level of uncomputability. Recursion 
theory also includes the study of generalized computability and definability. Recursion theory grew from of the work 
of Alonzo Church and Alan Turing in the 1930s, which was greatly extended by Kleene and Post in the 1940s. 

Classical recursion theory focuses on the computability of functions from the natural numbers to the natural 
numbers. The fundamental results establish a robust, canonical class of computable functions with numerous 
independent, equivalent characterizations using Turing machines, X calculus, and other systems. More advanced 
results concern the structure of the Turing degrees and the lattice of recursively enumerable sets. 

Generalized recursion theory extends the ideas of recursion theory to computations that are no longer necessarily 
finite. It includes the study of computability in higher types as well as areas such as hyperarithmetical theory and 
a-recursion theory. 

Contemporary research in recursion theory includes the study of applications such as algorithmic randomness, 
computable model theory, and reverse mathematics, as well as new results in pure recursion theory. 

Algorithmically unsolvable problems 

An important subfield of recursion theory studies algorithmic unsolvability; a decision problem or function problem 
is algorithmically unsolvable if there is no possible computable algorithm which returns the correct answer for all 
legal inputs to the problem. The first results about unsolvability, obtained independently by Church and Turing in 
1936, showed that the Entscheidungsproblem is algorithmically unsolvable. Turing proved this by establishing the 
unsolvability of the halting problem, a result with far-ranging implications in both recursion theory and computer 
science. 
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There are many known examples of undecidable problems from ordinary mathematics. The word problem for groups 
was proved algorithmically unsolvable by Pyotr Novikov in 1955 and independently by W. Boone in 1959. The busy 
beaver problem, developed by Tibor Rado in 1962, is another well-known example. 

Hilbert's tenth problem asked for an algorithm to determine whether a multivariate polynomial equation with integer 
coefficients has a solution in the integers. Partial progress was made by Julia Robinson, Martin Davis and Hilary 
Putnam. The algorithmic unsolvability of the problem was proved by Yuri Matiyasevich in 1970 (Davis 1973). 

Proof theory and constructive mathematics 

Proof theory is the study of formal proofs in various logical deduction systems. These proofs are represented as 
formal mathematical objects, facilitating their analysis by mathematical techniques. Several deduction systems are 
commonly considered, including Hilbert-style deduction systems, systems of natural deduction, and the sequent 
calculus developed by Gentzen. 

The study of constructive mathematics, in the context of mathematical logic, includes the study of systems in 
non-classical logic such as intuitionistic logic, as well as the study of predicative systems. An early proponent of 
predicativism was Hermann Weyl, who showed it is possible to develop a large part of real analysis using only 
predicative methods (Weyl 1918). 

Because proofs are entirely finitary, whereas truth in a structure is not, it is common for work in constructive 
mathematics to emphasize provability. The relationship between provability in classical (or nonconstructive) systems 
and provability in intuitionistic (or constructive, respectively) systems is of particular interest. Results such as the 
Godel-Gentzen negative translation show that it is possible to embed (or translate) classical logic into intuitionistic 
logic, allowing some properties about intuitionistic proofs to be transferred back to classical proofs. 

Recent developments in proof theory include the study of proof mining by Ulrich Kohlenbach and the study of 
proof-theoretic ordinals by Michael Rathjen. 

Connections with computer science 

The study of computability theory in computer science is closely related to the study of computability in 
mathematical logic. There is a difference of emphasis, however. Computer scientists often focus on concrete 
programming languages and feasible computability, while researchers in mathematical logic often focus on 
computability as a theoretical concept and on noncomputability. 

The theory of semantics of programming languages is related to model theory, as is program verification (in 
particular, model checking). The Curry-Howard isomorphism between proofs and programs relates to proof theory, 
especially intuitionistic logic. Formal calculi such as the lambda calculus and combinatory logic are now studied as 
idealized programming languages. 

Computer science also contributes to mathematics by developing techniques for the automatic checking or even 
finding of proofs, such as automated theorem proving and logic programming. 

Descriptive complexity theory relates logics to computational complexity. The first significant result in this area, 
Fagin's theorem (1974) established that NP is precisely the set of languages expressible by sentences of existential 
second-order logic. 
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Foundations of mathematics 

In the 19th century, mathematicians became aware of logical gaps and inconsistencies in their field. It was shown 
that Euclid's axioms for geometry, which had been taught for centuries as an example of the axiomatic method, were 
incomplete. The use of infinitesimals, and the very definition of function, came into question in analysis, as 
pathological examples such as Weierstrass' nowhere-differentiable continuous function were discovered. 

Cantor's study of arbitrary infinite sets also drew criticism. Leopold Kronecker famously stated "God made the 
integers; all else is the work of man," endorsing a return to the study of finite, concrete objects in mathematics. 
Although Kronecker' s argument was carried forward by construed vists in the 20th century, the mathematical 
community as a whole rejected them. David Hilbert argued in favor of the study of the infinite, saying "No one shall 
expel us from the Paradise that Cantor has created." 

Mathematicians began to search for axiom systems that could be used to formalize large parts of mathematics. In 
addition to removing ambiguity from previously-naive terms such as function, it was hoped that this axiomatization 
would allow for consistency proofs. In the 19th century, the main method of proving the consistency of a set of 
axioms was to provide a model for it. Thus, for example, non-Euclidean geometry can be proved consistent by 
defining point to mean a point on a fixed sphere and line to mean a great circle on the sphere. The resulting structure, 
a model of elliptic geometry, satisfies the axioms of plane geometry except the parallel postulate. 

With the development of formal logic, Hilbert asked whether it would be possible to prove that an axiom system is 
consistent by analyzing the structure of possible proofs in the system, and showing through this analysis that it is 
impossible to prove a contradiction. This idea led to the study of proof theory. Moreover, Hilbert proposed that the 
analysis should be entirely concrete, using the term finitary to refer to the methods he would allow but not precisely 
defining them. This project, known as Hilbert's program, was seriously affected by Godel's incompleteness theorems, 
which show that the consistency of formal theories of arithmetic cannot be established using methods formalizable in 
those theories. Gentzen showed that it is possible to produce a proof of the consistency of arithmetic in a finitary 
system augmented with axioms of transfinite induction, and the techniques he developed to so do were seminal in 
proof theory. 

A second thread in the history of foundations of mathematics involves nonclassical logics and constructive 
mathematics. The study of constructive mathematics includes many different programs with various definitions of 
constructive. At the most accommodating end, proofs in ZF set theory that do not use the axiom of choice are called 
constructive by many mathematicians. More limited versions of constructivism limit themselves to natural numbers, 
number-theoretic functions, and sets of natural numbers (which can be used to represent real numbers, facilitating 
the study of mathematical analysis). A common idea is that a concrete means of computing the values of the function 
must be known before the function itself can be said to exist. 

In the early 20th century, Luitzen Egbertus Jan Brouwer founded intuitionism as a philosophy of mathematics. This 
philosophy, poorly understood at first, stated that in order for a mathematical statement to be true to a 
mathematician, that person must be able to intuit the statement, to not only believe its truth but understand the reason 
for its truth. A consequence of this definition of truth was the rejection of the law of the excluded middle, for there 
are statements that, according to Brouwer, could not be claimed to be true while their negations also could not be 
claimed true. Brouwer's philosophy was influential, and the cause of bitter disputes among prominent 
mathematicians. Later, Kleene and Kreisel would study formalized versions of intuitionistic logic (Brouwer rejected 
formalization, and presented his work in unformalized natural language). With the advent of the BHK interpretation 
and Kripke models, intuitionism became easier to reconcile with classical mathematics. 
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See also 

• List of mathematical logic topics 

• List of computability and complexity topics 

• List of set theory topics 

• List of first-order theories 

• Knowledge representation 

• Metalogic 

• Logic symbols 

Notes 

[1] Undergraduate texts include Boolos, Burgess, and Jeffrey (2002), Enderton (2001), and Mendelson (1997). A classic graduate text by 

Shoenfield (2001) first appeared in 1967. 
[2] A detailed study of this terminology is given by Soare (1996). 

[3] Ferreiros (2001) surveys the rise of first-order logic over other formal logics in the early 20th century. 
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Heyting arithmetic 

In mathematical logic, Heyting arithmetic (sometimes abbreviated HA) is an axiomatization of arithmetic in 
accordance with the philosophy of intuitionism. It is named after Arend Heyting, who first proposed it. 

Heyting arithmetic adopts the axioms of Peano arithmetic (PA), but uses intuitionistic logic as its rules of inference. 
In particular, the law of the excluded middle does not hold in general, though the induction axiom can be used to 
prove many specific cases. For instance, one can prove that Vx, yGN:x = yvx±y is a. theorem (any two natural 
numbers are either equal to each other, or not equal to each other). In fact, since "=" is the only predicate symbol in 
Heyting arithmetic, it then follows that, for any quantifier-free formula p, V x, y, z, ■ ■ ■ £ N : p v -ip is a theorem 
(where x,y,z. ■ ■ are the free variables in p). 

Kurt Godel studied the relationship between Heyting arithmetic and Peano arithmetic. He used the Godel-Gentzen 
negative translation to prove in 1933 that if HA is consistent, then PA is also consistent. 

Heyting arithmetic should not be confused with Heyting algebras, which are the intuitionistic analogue of Boolean 
algebras. 

External links 

• Stanford Encyclopedia of Philosophy: "Intuitionistic Number Theory by Joan Moschovakis. 
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Symbolic logic 

Symbolic logic may refer to: 

• First-order logic, a system of formal logic 

• Mathematical logic, a field of mathematics 
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Metatheory 

A metatheory or meta-theory is a theory whose subject matter is some other theory. In other words it is a theory 
about a theory. Statements made in the metatheory about the theory are called metatheorems. 

According to the systemic TOGA meta-theory ^ , a meta-theory may refer to the specific point of view on a theory 
and to its subjective meta-properties, but not to its application domain. In the above sense, a theory T of the domain 
D is a meta-theory if D is a theory or a set of theories. A general theory is not a meta-theory because its domain D 
are not theories. 

The following is an example of a meta-theoretical statement: 

Any physical theory is always provisional, in the sense that it is only a hypothesis; you can never prove it. No matter how many times the 

■ ■ 

results of experiments agree with some theory, you can never be sure that the next time the result will not contradict the theory. On the other 
hand, you can disprove a theory by finding even a single observation that disagrees with the predictions of the theory. 

Meta-theory belongs to the philosophical specialty of epistemology and metamathematics, as well as being an object 
of concern to the area in which the individual theory is conceived. An emerging domain of meta-theories is 
systemics. 

Taxonomy 

Examining groups of related theories, a first finding may be to identify classes of theories, thus specifying a 
taxonomy of theories. A proof engendered by a metatheory is called a metatheorem. 

History 

The concept burst upon the scene of twentieth-century philosophy as a result of the work of the German 
mathematician David Hilbert, who in 1905 published a proposal for proof of the consistency of mathematics, 
creating the field of metamathematics. His hopes for the success of this proof were dashed by the work of Kurt 
Godel who in 1931 proved this to be unattainable by his incompleteness theorems. Nevertheless, his program of 
unsolved mathematical problems, out of which grew this metamathematical proposal, continued to influence the 
direction of mathematics for the rest of the twentieth century. 

The study of metatheory became widespread during the rest of that century by its application in other fields, notably 
scientific linguistics and its concept of metalanguage. 

See also 

• meta- 

• meta-knowledge 

• Metalogic 

• Metamathematics 

• Philosophy of social science 
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Metalogic 

Metalogic is the study of the metatheory of logic. While logic is the study of the manner in which logical systems 
can be used to decide the correctness of arguments, metalogic studies the properties of the logical systems 
themselves.^ According to Geoffrey Hunter, while logic concerns itself with the "truths of logic," metalogic 
concerns itself with the theory of "sentences used to express truths of logic." 

The basic objects of study in metalogic are formal languages, formal systems, and their interpretations. The study of 
interpretation of formal systems is the branch of mathematical logic known as model theory, while the study of 
deductive apparatus is the branch known as proof theory. 

History 

Metalogical questions have been asked since the time of Aristotle. However, it was only with the rise of formal 
languages in the late 19th and early 20th century that investigations into the foundations of logic began to flourish. In 
1904, David Hilbert observed that in investigating the foundations of mathematics that logical notions are 
presupposed, and therefore a simultaneous account of metalogical and metamathematical principles was required. 
Today, metalogic and metamathematics are largely synonymous with each other, and both have been substantially 
subsumed by mathematical logic in academia. 

Important distinctions in metalogic 
Metalanguage-Object language 

In metalogic, formal languages are sometimes called object languages. The language used to make statements about 
an object language is called a metalanguage. This distinction is a key difference between logic and metalogic. While 
logic deals with proofs in a formal system, expressed in some formal language, metalogic deals with proofs about a 
formal system which are expressed in a metalanguage about some object language. 

Syntax-semantics 

In metalogic, 'syntax' has to do with formal languages or formal systems without regard to any interpretation of 
them, whereas, 'semantics' has to do with interpretations of formal languages. The term 'syntactic' has a slightly 
wider scope than 'proof-theoretic', since it may be applied to properties of formal languages without any deductive 
systems, as well as to formal systems. 'Semantic' is synonymous with 'model-theoretic'. 
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Use-mention 

In metalogic, the words 'use' and 'mention', in both their noun and verb forms, take on a technical sense in order to 
identify an important distinction. 1 The use— mention distinction (sometimes referred to as the words -as -words 
distinction) is the distinction between using a word (or phrase) and mentioning it. Usually it is indicated that an 
expression is being mentioned rather than used by enclosing it in quotation marks, printing it in italics, or setting the 
expression by itself on a line. The enclosing in quotes of an expression gives us the name of an expression, for 
example: 

'Metalogic' is the name of this article. 
This article is about metalogic. 

Type-token 

The type-token distinction is a distinction in metalogic, that separates an abstract concept from the objects which are 
particular instances of the concept. For example, the particular bicycle in your garage is a token of the type of thing 
known as "The bicycle." Whereas, the bicycle in your garage is in a particular place at a particular time, that is not 
true of "the bicycle" as used in the sentence: "The bicycle has become more popular recently." This distinction is 
used to clarify the meaning of symbols of formal languages. 

Overview 
Formal language 

A formal language is an organized set of symbols the essential feature of which is that it can be precisely defined in 
terms of just the shapes and locations of those symbols. Such a language can be defined, then, without any reference 
to any meanings of any of its expressions; it can exist before any interpretation is assigned to it — that is, before it has 
any meaning. First order logic is expressed in some formal language. A formal grammar determines which symbols 
and sets of symbols are formulas in a formal language. 

A formal language can be defined formally as a set A of strings (finite sequences) on a fixed alphabet a. Some 
authors, including Carnap, define the language as the ordered pair <a, A>. Carnap also requires that each element 
of a must occur in at least one string in A. 

Formation rules 

Formation rules (also called formal grammar) are a precise description of the well-formed formulas of a formal 
language. It is synonymous with the set of strings over the alphabet of the formal language which constitute well 
formed formulas. However, it does not describe their semantics (i.e. what they mean). 

Formal systems 

A formal system (also called a logical calculus, or a logical system) consists of a formal language together with a 
deductive apparatus (also called a deductive system). The deductive apparatus may consist of a set of transformation 
rules (also called inference rules) or a set of axioms, or have both. A formal system is used to derive one expression 
from one or more other expressions. 

A formal system can be formally defined as an ordered triple <a, X » T) d>, where J) d is the relation of direct 
derivability. This relation is understood in a comprehensive sense such that the primitive sentences of the formal 
system are taken as directly derivable from the empty set of sentences. Direct derivability is a relation between a 
sentence and a finite, possibly empty set of sentences. Axioms are laid down in such a way that every first place 
member of J) d is a member of X an d every second place member is a finite subset of X • 
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It is also possible to define a formal system using only the relation £) d. In this way we can omit X » an d a in the 
definitions of interpreted formal language, and interpreted formal system. However, this method can be more 
difficult to understand and work with. 1 

Formal proofs 

A formal proof is a sequence of well-formed formulas of a formal language, the last one of which is a theorem of a 
formal system. The theorem is a syntactic consequence of all the well formed formulae preceding it in the proof. For 
a well formed formula to qualify as part of a proof, it must be the result of applying a rule of the deductive apparatus 
of some formal system to the previous well formed formulae in the proof sequence. 

Interpretations 

An interpretation of a formal system is the assignment of meanings, to the symbols, and truth- values to the sentences 
of the formal system. The study of interpretations is called Formal semantics. Giving an interpretation is 
synonymous with constructing a model 

Results in metalogic 

Results in metalogic consist of such things as formal proofs demonstrating the consistency, completeness, and 
decidability of particular formal systems. 

Major results in metalogic include: 

• Proof of the uncountability of the set of all subsets of the set of natural numbers (Cantor's theorem 1891) 

• Lowenheim- Skolem theorem (Leopold Lowenheim 1915 and Thoralf Skolem 1919) 

• Proof of the consistency of truth- functional propositional logic (Emil Post 1920) 

• Proof of the semantic completeness of truth-functional propositional logic (Paul Bernays 1918), ^ (Emil Post 
1920) [2] 

• Proof of the syntactic completeness of truth- functional propositional logic (Emil Post 1920) 

T21 

• Proof of the decidability of truth-functional propositional logic (Emil Post 1920) 

• Proof of the consistency of first order monadic predicate logic (Leopold Lowenheim 1915) 

• Proof of the semantic completeness of first order monadic predicate logic (Leopold Lowenheim 1915) 

• Proof of the decidability of first order monadic predicate logic (Leopold Lowenheim 1915) 

• Proof of the consistency of first order predicate logic (David Hilbert and Wilhelm Ackermann 1928) 

• Proof of the semantic completeness of first order predicate logic (Godel's completeness theorem 1930) 

• Proof of the undecidability of first order predicate logic (Church's theorem 1936) 

• Godel's first incompleteness theorem 1931 

• Godel's second incompleteness theorem 1931 

See also 

• Metamathematics 
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Signature 

Notes 

Harald Bohr is his younger brother, and Aage Bohr is his son. 

Niels Henrik David Bohr (Danish pronunciation: [nels "boB 9 ]; 7 October 1885 - 18 November 1962) was a Danish 
physicist who made fundamental contributions to understanding atomic structure and quantum mechanics, for which 
he received the Nobel Prize in Physics in 1922. Bohr mentored and collaborated with many of the top physicists of 
the century at his institute in Copenhagen. He was part of a team of physicists working on the Manhattan Project. 
Bohr married Margrethe N0rlund in 1912, and one of their sons, Aage Bohr, grew up to be an important physicist 
who in 1975 also received the Nobel prize. Bohr has been described as one of the most influential scientists of the 
20thcentury. [1] 

Biography 
Early years 

Bohr was born in Copenhagen, Denmark, in 1885. His father, Christian Bohr, a devout Lutheran, was professor of 
physiology at the University of Copenhagen (it is his name which is given to the Bohr shift or Bohr effect), while his 
mother, Ellen Adler Bohr, came from a wealthy Jewish family prominent in Danish banking and parliamentary 
circles. His brother was Harald Bohr, a mathematician and Olympic footballer who played on the Danish national 
team. Niels Bohr was a passionate footballer as well, and the two brothers played a number of matches for the 
Copenhagen-based Akademisk Boldklub, with Niels in goal. There is, however, no truth in the oft-repeated claim 
that Niels Bohr emulated his brother Harald by playing for the Danish national team. 

In 1903 Bohr enrolled as an undergraduate at Copenhagen University, initially studying philosophy and 
mathematics. In 1905, prompted by a gold medal competition sponsored by the Royal Danish Academy of Sciences 
and Letters, he conducted a series of experiments to examine the properties of surface tension, using his father's 
laboratory in the university, familiar to him from assisting there since childhood. His essay won the prize, and it was 
this success that decided Bohr to abandon philosophy and adopt physics. As a student under Christian Christiansen 
he received his doctorate in 1911. As a post-doctoral student, Bohr first conducted experiments under J. J. Thomson 
at Trinity College, Cambridge. In 1912 he joined Ernest Rutherford at Manchester University and he adapted 
Rutherford's nuclear structure to Max Planck's quantum theory and so obtained a theory of atomic structure which, 
with later improvements, mainly as a result of Heisenberg's concepts, remains valid to this day. On the basis of 
Rutherford's theories, Bohr published his model of atomic structure in 1913, introducing the theory of electrons 
traveling in orbits around the atom's nucleus, the chemical properties of the element being largely determined by the 
number of electrons in the outer orbits. Bohr introduced the idea that an electron could drop from a higher-energy 
orbit to a lower one, emitting a photon (light quantum) of discrete energy. This became a basis for quantum theory. 
After four productive years with Ernest Rutherford in Manchester, Bohr returned to Denmark becoming in 1918 
director of the newly created Institute of Theoretical Physics. 

Niels Bohr and his wife Margrethe N0rlund Bohr had six sons. Their oldest died in a tragic boating accident and 
another died from childhood meningitis. The others went on to lead successful lives, including Aage Bohr, who 
became a very successful physicist and, like his father, won a Nobel Prize in physics, in 1975. 



Niels Bohr 



508 



Physics 

In 1916, Niels Bohr became a professor at the University of Copenhagen. With the assistance of the Danish 
government and the Carlsberg Foundation, he succeeded in founding the Institute of Theoretical Physics in 1921, of 
which he became director J 4 ^ In 1922, Bohr was awarded the Nobel Prize in physics "for his services in the 
investigation of the structure of atoms and of the radiation emanating from them." Bohr's institute served as a focal 
point for theoretical physicists in the 1920s and '30s, and most of the world's best known theoretical physicists of that 
period spent some time there. 

Bohr also conceived the principle of complementarity: that items could be 
separately analyzed as having several contradictory properties. For example, 
physicists currently conclude that light behaves either as a wave or a stream of 
particles depending on the experimental framework — two apparently mutually 
exclusive properties — on the basis of this principle. Bohr found philosophical 
applications for this daringly original principle. Albert Einstein much preferred 
the determinism of classical physics over the probabilistic new quantum physics 
(to which Max Planck and Einstein himself had contributed). He and Bohr had 
good-natured arguments over the truth of this principle throughout their lives 
(see Bohr-Einstein debates). 




Niels Bohr as a young man. Exact 
date of photo unknown. 




Niels Bohr and Albert Einstein 
debating quantum theory at Paul 
Ehrenfest's home in Leiden 
(December 1925). 



Werner Heisenberg worked as an assistant to Bohr and university lecturer in 
Copenhagen from 1926 to 1927. It was in Copenhagen, in 1927, that Heisenberg 
developed his uncertainty principle, while working on the mathematical 
foundations of quantum mechanics. Heisenberg was later to be head of the 
German atomic bomb project. In 1941, during the German occupation of 
Denmark in World War II, Bohr was visited by Heisenberg in Copenhagen (see 
section below). In 1943, shortly before he was to be arrested by the German 
police, Bohr escaped to Sweden, and then traveled to London. 

Atomic research 

Niels Bohr worked at the top-secret Los Alamos laboratory in New Mexico, 
U.S., on the Manhattan Project, where he was known by the assumed name of 
Nicholas Baker for security reasons His role in the project was important as he 
was a knowledgeable consultant or "father confessor" on the project. He was 
concerned about a nuclear arms race, and is quoted as saying, "That is why I 
went to America. They didn't need my help in making the atom bomb.' 



,[6] 



Bohr believed that atomic secrets should be shared by the international scientific 
community. After meeting with Bohr, J. Robert Oppenheimer suggested Bohr 
visit President Franklin D. Roosevelt to convince him that the Manhattan Project 
should be shared with the Russians in the hope of speeding up its results. 
Roosevelt suggested Bohr return to the United Kingdom to try to win British approval. Winston Churchill disagreed 
with the idea of openness towards the Russians to the point that he wrote in a letter: "It seems to me Bohr ought to be 
confined or at any rate made to see that he is very near the edge of mortal crimes." 



[7] 
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After the war Bohr returned to Copenhagen, advocating the peaceful use of 
nuclear energy. When awarded the Order of the Elephant by the Danish 
government, he designed his own coat of arms which featured a taijitu 
(symbol of yin and yang) and the Latin motto contraria sunt complementa: 
opposites are complementary. He died in Copenhagen in 1962 of heart 
failure He is buried in the Assistens Kirkegard in the N0rrebro section of 
Copenhagen. 

Contributions to Physics and Chemistry 

• The Bohr model of the atom, the theory that electrons travel in discrete 
orbits around the atom's nucleus. 

• The shell model of the atom, where the chemical properties of an element 
are determined by the electrons in the outermost orbit. 

• The correspondence principle, the basic tool of Old quantum theory. 

• The liquid drop model of the atomic nucleus. 

235 r 101 

• Identified the isotope of uranium that was responsible for slow-neutron fission - U. 

• Much work on the Copenhagen interpretation of quantum mechanics. 

• The principle of complementarity: that items could be separately analyzed as having several contradictory 
properties. 




Coat of arms 



Kierkegaard's influence on Bohr 

It is generally accepted that Bohr read the 19th century Danish philosopher S0ren Kierkegaard. Richard Rhodes 
argues in The Making of the Atomic Bomb that Bohr was influenced by Kierkegaard via the philosopher Harald 
H0ffding, who was strongly influenced by Kierkegaard and who was an old friend of Bohr's father. In 1909, Bohr 
sent his brother Kierkegaard's Stages on Life's Way as a birthday gift. In the enclosed letter, Bohr wrote, "It is the 
only thing I have to send home; but I do not believe that it would be very easy to find anything better.... I even think 
it is one of the most delightful things I have ever read." Bohr enjoyed Kierkegaard's language and literary style, but 
mentioned that he had some "disagreement with [Kierkegaard's ideas]. "^^ 

Given this, there has been some dispute over whether Kierkegaard influenced Bohr's philosophy and science. David 
ri2i 

Favrholdt argues that Kierkegaard had minimal influence over Bohr's work; taking Bohr's statement about 

ri3i 

disagreeing with Kierkegaard at face value, while Jan Faye endorses the opposing point of view by arguing that 
one can disagree with the content of a theory while accepting its general premises and structure J 14 ^ 



Relationship with Heisenberg 

Bohr and Werner Heisenberg enjoyed a strong mentor/protege relationship up to the onset of World War II. 
Heisenberg had made Bohr aware of his talent during a lecture in 1922 in Gottingen. During the mid- 1920s, 
Heisenberg worked with Bohr at the institute in Copenhagen. Heisenberg, like most of Bohr's assistants, learned 
Danish. Heisenberg's uncertainty principle was developed during this period, as was Bohr's complementarity 
principle. 

By the time of World War II, the relationship became strained; this was in part because Bohr, with his partially 
Jewish heritage, remained in occupied Denmark, while Heisenberg remained in Germany and became head of the 
German nuclear effort. Heisenberg made a famous visit to Bohr in September 1941 and during a private moment it 
seems that he began to address nuclear energy and morality as well as the war. Neither Bohr nor Heisenberg spoke 
about it in any detail or left written records of this part of the meeting and they were alone and outside J 15 ^ Bohr 
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seems to have reacted by terminating that conversation abruptly while not giving Heisenberg hints in any direction. 

While some suggest that the relationship became strained at this meeting, other evidence shows that the level of 

contact had been reduced considerably for some time already. Heisenberg suggested that the fracture occurred later. 

In correspondence to his wife, Heisenberg described the final visit of the trip: "Today I was once more, with 

Weizsaecker, at Bohr's. In many ways this was especially nice, the conversation revolved for a large part of the 

evening around purely human concerns, Bohr was reading aloud, I played a Mozart Sonata (A-Major)."^ Ivan 

Supek, one of Heisenberg's students and friends, claimed that the main figure of the meeting was actually 

ri7i 

Weizsacker who tried to persuade Bohr to mediate peace between Great Britain and Germany. 
Tube Alloys 

"Tube Alloys" was the code-name for the British nuclear weapon program. British intelligence inquired about Bohr's 
availability for work or insights of particular value. Bohr's reply made it clear that he could not help. This reply, like 
his reaction to Heisenberg, made sure that if Gestapo intercepted anything attributed to Bohr it would point to no 
knowledge regarding nuclear energy as it stood in 1941. This does not exclude the possibility that Bohr privately 
made calculations going further than his work in 1939 with Wheeler. 

After leaving Denmark in the dramatic day and night (October 1943) when most Jews were able to escape to Sweden 
due to exceptional circumstances (see Rescue of the Danish Jews), Bohr was quickly asked again to join the British 
effort and he was flown to the UK. He was evacuated from Stockholm in 1943 in an unarmed De Havilland 
Mosquito operated by British Overseas Airways Corporation (BOAC). Passengers on BOAC's Mosquitos were 
carried in an improvised cabin in the bomb bay. The flight almost ended in tragedy as Bohr did not don his oxygen 
equipment as instructed and passed out at high altitude. He would have died had not the pilot surmising from Bohr's 
lack of response to intercom communication that he had lost consciousness, descended to a lower altitude for the 
remainder of the flight. Bohr's comment was that he had slept like a baby for the entire flight. 

As part of the UK team on "Tube Alloys" Bohr went to Los Alamos. Oppenheimer credited Bohr warmly for his 
guiding help during certain discussions among scientists there. Discreetly, he met President Franklin D. Roosevelt 
and later Winston Churchill to warn against the perilous perspectives that would follow from separate development 
of nuclear weapons by several powers rather than some form of controlled sharing of the knowledge, which would 
spread quickly in any case. Only in the 1950s, after the Soviet Union's first nuclear weapon test, was it possible to 
create the International Atomic Energy Agency along the lines of Bohr's suggestion. 

Speculation 

In 1957, while the author Robert Jungk was working on the book Brighter Than a Thousand Suns, Heisenberg wrote 

to Jungk explaining that he had visited Copenhagen to communicate to Bohr his view that scientists on either side 

should help prevent development of the atomic bomb, that the German attempts were entirely focused on energy 

ri8i 

production and that Heisenberg's circle of colleagues tried to keep it that way. Heisenberg acknowledged that his 
cryptic approach of the subject had so alarmed Bohr that the discussion failed. Heisenberg nuanced his claims and 
avoided the implication that he and his colleagues had sabotaged the bomb effort; this nuance was lost in Jungk's 
original publication of the book, which implied that the German atomic bomb project was obstructed by Heisenberg. 

When Bohr saw Jungk's erroneous depiction in the Danish translation of the book, he disagreed. He drafted (but 

never sent) a letter to Heisenberg, stating that while Heisenberg had indeed discussed the subject of nuclear weapons 

in Copenhagen, Heisenberg had never alluded to the fact that he might be resisting efforts to build such weapons. 

ri9i 

Bohr dismissed the idea of any pact as hindsight. 

Michael Frayn's play Copenhagen, which was performed in London (for five years), Copenhagen, Gothenburg, 
Rome, Athens, Geneva and on Broadway in New York, explores what might have happened at the 1941 meeting 
between Heisenberg and Bohr. Frayn points in particular to the onus of being one of the few to understand what it 
would mean to create a nuclear weapon. 
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Open World 

Bohr advocated informing the Soviet authorities that the atomic bomb would soon be in use. In 1944 he obtained an 
audience with Winston Churchill, who became worried about whether Bohr was a security riskJ 20 ^ In 1950 he 
addressed an 'Open Letter' to the United Nations. 



Legacy 

T221 

• He was one of the founding fathers of CERN in 1954. 

• Received the first ever Atoms for Peace Award in 1957. 

• In 1965, three years after Bohr's death, the Institute of Physics at the University of Copenhagen changed its name 
to the Niels Bohr Institute. 

• The Bohr models semicentennial was commemorated in Denmark on 21 November 1963 with a postage stamp 
depicting Bohr, the hydrogen atom and the formula for the difference of any two hydrogen energy levels: 

hv = e 2 — €i • 

• Bohrium (a chemical element, atomic number 107) is named in honour of Niels Bohr. 

• Hafnium, another chemical element, whose properties were predicted by Niels Bohr, was named by him after 
Hafnia, Copenhagen's Latin name. 

• Asteroid 3948 Bohr is named after him. 

• The Centennial of Bohr's birth was commemorated in Denmark on 3 October 1985 with a postage stamp 
depicting Bohr with his wife Margrethe. 

• In 1997 the Danish National Bank started circulating the 500-krone banknote with the portrait of Niels Bohr 

i • • [23] [24] 

smoking a pipe. 

[25] 

• Bohr has been a common name in Europe since the Middle Ages. It remains fairly common in Europe and 
spread to the U.S. with pilgrims named Bohr settling there. There was an notable increase in the middle name 
Bohr throughout Europe and America following Niels Bohr's death. 
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Notes 

He is the father of Erwin Planck who was executed in 1945 by the Gestapo for his part in the July 20 plot. 



Max Planck (April 23, 1858 - October 4, 1947) was a German physicist. He is considered to be the founder of the 
quantum theory, and thus one of the most important physicists of the twentieth century. Planck was awarded the 
Nobel Prize in Physics in 1918. 
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Life and career 



Planck came from a traditional, intellectual family. His paternal great-grandfather and grandfather were both 
theology professors in Gottingen; his father was a law professor in Kiel and Munich; and his paternal uncle was a 
judge. 

Planck was born in Kiel, Holstein, to Johann Julius Wilhelm Planck 
and his second wife, Emma Patzig. He was baptised with the name of 
Karl Ernst Ludwig Marx Planck, of his given names, Marx (a now 
obsolete variant of Markus or maybe simply an error for Max, which is 



/*^^*-* fl^*^^ t^u^^u 

Max Planck's signature at ten years of age. 



[1] 



actually short for Maximilian) was indicated as the primary name. 
However, by the age of ten he signed with the name Max and used this 



for the rest of his life. 



[2] 



He was the sixth child in the family, though two of his siblings were 
from his father's first marriage. Among his earliest memories was the 
marching of Prussian and Austrian troops into Kiel during the Danish-Prussian war of 1864. In 1867 the family 
moved to Munich, and Planck enrolled in the Maximilians gymnasium school, where he came under the tutelage of 
Hermann Muller, a mathematician who took an interest in the youth, and taught him astronomy and mechanics as 
well as mathematics. It was from Muller that Planck first learned the principle of conservation of energy. Planck 
graduated early, at age 17. This is how Planck first came in contact with the field of physics. 

Planck was gifted when it came to music. He took singing lessons and played piano, organ and cello, and composed 
songs and operas. However, instead of music he chose to study physics. 

The Munich physics professor Philipp von Jolly advised Planck against 
going into physics, saying, "in this field, almost everything is already 
discovered, and all that remains is to fill a few holes." Planck replied 
that he did not wish to discover new things, only to understand the 
known fundamentals of the field, and began his studies in 1874 at the 
University of Munich. Under Jolly's supervision, Planck performed the 
only experiments of his scientific career, studying the diffusion of 
hydrogen through heated platinum, but transferred to theoretical 
physics. 

In 1877 he went to Berlin for a year of study with physicists Hermann 
von Helmholtz and Gustav Kirchhoff and the mathematician Karl 
Weierstrass. He wrote that Helmholtz was never quite prepared, spoke 
slowly, miscalculated endlessly, and bored his listeners, while 
Kirchhoff spoke in carefully prepared lectures which were dry and 
monotonous. He soon became close friends with Helmholtz. While 
there he undertook a program of mostly self-study of Clausius's 
writings, which led him to choose heat theory as his field. 




Planck as a young man, 1878 



In October 1878 Planck passed his qualifying exams and in February 
1879 defended his dissertation, Uber den zweiten Hauptsatz der mechanischen Warmetheorie (On the second law of 
thermodynamics). He briefly taught mathematics and physics at his former school in Munich. 

In June 1880 he presented his habilitation thesis, Gleichgewichtszustande isotroper Korper in verschiedenen 
Temperaturen (Equilibrium states of isotropic bodies at different temperatures). 
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Academic career 

With the completion of his habilitation thesis, Planck became an unpaid private lecturer in Munich, waiting until he 
was offered an academic position. Although he was initially ignored by the academic community, he furthered his 
work on the field of heat theory and discovered one after another the same thermodynamical formalism as Gibbs 
without realizing it. Clausius's ideas on entropy occupied a central role in his work. 

In April 1885 the University of Kiel appointed Planck as associate professor of theoretical physics. Further work on 
entropy and its treatment, especially as applied in physical chemistry, followed. He proposed a thermodynamic basis 
for Svante Arrhenius's theory of electrolytic dissociation. 

Within four years he was named the successor to Kirchhoff s position at the University of Berlin — presumably 
thanks to Helmholtz's intercession — and by 1892 became a full professor. In 1907 Planck was offered Boltzmann's 
position in Vienna, but turned it down to stay in Berlin. During 1909, as University of Berlin professor, eight of his 
lectures were used by the Ernest Kempton Adams Fund for Physical Research in Theoretical Physics at Columbia 
University in New York City for a series of lectures translated by Columbia University professor A. P. Wills. ^ He 
retired from Berlin on January 10, 1926, and was succeeded by Erwin Schrodinger. 

Family 

In March 1887 Planck married Marie Merck (1861—1909), sister of a school fellow, and moved with her into a sublet 
apartment in Kiel. They had four children: Karl (1888-1916), the twins Emma (1889-1919) and Grete (1889-1917), 
and Erwin (1893-1945). 

After the apartment in Berlin, the Planck family lived in a villa in Berlin- Grunewald, WangenheimstraBe 21. Several 
other professors of Berlin University lived nearby, among them theologian Adolf von Harnack, who became a close 
friend of Planck. Soon the Planck home became a social and cultural centre. Numerous well-known scientists, such 
as Albert Einstein, Otto Hahn and Lise Meitner were frequent visitors. The tradition of jointly performing music had 
already been established in the home of Helmholtz. 

After several happy years, in July 1909 Marie Planck died, possibly from tuberculosis. In March 1911 Planck 
married his second wife, Marga von Hoesslin (1882-1948); in December his third son Hermann was born. 

During the First World War Planck's second son Erwin was taken prisoner by the French in 1914, while his oldest 
son Karl was killed in action at Verdun. Grete died in 1917 while giving birth to her first child. Her sister died the 
same way two years later, after having married Grete' s widower. Both granddaughters survived and were named 
after their mothers. Planck endured these losses stoically. 

In January 1945, Erwin, to whom he had been particularly close, was sentenced to death by the Nazi 
Volksgerichtshof because of his participation in the failed attempt to assassinate Hitler in July 1944. Erwin was 
executed on 23 January 1945.^ 

• Wives: Marie Merck (m. 1887), Marga von Hoesslin (m. 1910) 

• Children: Karl (1888-1916), twins Emma (1889-1919) and Grete (1889-1917), Erwin (1893-1945), Hermann 
(1911-1954) 

Professor at Berlin University 

In Berlin, Planck joined the local Physical Society. He later wrote about this time: "In those days I was essentially 
the only theoretical physicist there, whence things were not so easy for me, because I started mentioning entropy, but 
this was not quite fashionable, since it was regarded as a mathematical spook". Thanks to his initiative, the various 
local Physical Societies of Germany merged in 1898 to form the German Physical Society (Deutsche Physikalische 
Gesellschaft, DPG); from 1905 to 1909 Planck was the president. 

Planck started a six-semester course of lectures on theoretical physics, "dry, somewhat impersonal" according to Lise 
Meitner, "using no notes, never making mistakes, never faltering; the best lecturer I ever heard" according to an 
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English participant, James R. Partington, who continues: "There were always many standing around the room. As the 
lecture-room was well heated and rather close, some of the listeners would from time to time drop to the floor, but 
this did not disturb the lecture". Planck did not establish an actual "school"; the number of his graduate students was 
only about 20, among them: 

1897 Max Abraham (1875-1922) 

1904 Moritz Schlick (1882-1936) 

1906 Walther MeiBner (1882-1974) 

1906 Max von Laue (1879-1960) 

1907 Fritz Reiche (1883-1960) 
1912 Walter Schottky (1886-1976) 
1914 Walther Bothe (1891-1957) 

Black-body radiation 

In 1894 Planck turned his attention to the problem of black-body radiation. He had been commissioned by electric 
companies to create maximum light from lightbulbs with minimum energy. The problem had been stated by 
Kirchhoff in 1859: how does the intensity of the electromagnetic radiation emitted by a black body (a perfect 
absorber, also known as a cavity radiator) depend on the frequency of the radiation (i.e., the color of the light) and 
the temperature of the body? The question had been explored experimentally, but no theoretical treatment agreed 
with experimental values. Wilhelm Wien proposed Wien's law, which correctly predicted the behaviour at high 
frequencies, but failed at low frequencies. The Ray leigh- Jeans law, another approach to the problem, created what 
was later known as the "ultraviolet catastrophe", but contrary to many textbooks this was not a motivation for 
Planck. [6] 

Planck's first proposed solution to the problem in 1899 followed from what Planck called the "principle of 
elementary disorder", which allowed him to derive Wien's law from a number of assumptions about the entropy of 
an ideal oscillator, creating what was referred-to as the Wien-Planck law. Soon it was found that experimental 
evidence did not confirm the new law at all, to Planck's frustration. Planck revised his approach, deriving the first 
version of the famous Planck black-body radiation law, which described the experimentally observed black-body 
spectrum well. It was first proposed in a meeting of the DPG on October 19, 1900 and published in 1901. This first 
derivation did not include energy quantisation, and did not use statistical mechanics, to which he held an aversion. In 
November 1900, Planck revised this first approach, relying on Boltzmann's statistical interpretation of the second 
law of thermodynamics as a way of gaining a more fundamental understanding of the principles behind his radiation 
law. As Planck was deeply suspicious of the philosophical and physical implications of such an interpretation of 
Boltzmann's approach, his recourse to them was, as he later put it, "an act of despair ... I was ready to sacrifice any of 
my previous convictions about physics. "^ The central assumption behind his new derivation, presented to the DPG 
on 14 December 1900, was the supposition, now known as the Planck postulate, that electromagnetic energy could 
be emitted only in quantized form, in other words, the energy could only be a multiple of an elementary unit 
E = hv , where h is Planck's constant, also known as Planck's action quantum (introduced already in 1899), and 
V (the Greek letter nu, not the letter v) is the frequency of the radiation. 
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At first Planck considered that quantisation was only "a purely formal 
assumption ... actually I did not think much about it..."; nowadays this 
assumption, incompatible with classical physics, is regarded as the 
birth of quantum physics and the greatest intellectual accomplishment 
of Planck's career (Ludwig Boltzmann had been discussing in a 
theoretical paper in 1877 the possibility that the energy states of a 
physical system could be discrete). Further interpretation of the 
implications of Planck's work was advanced by Albert Einstein in 1905 
in connection with his work on the photoelectric effect — for this 
reason, the philosopher and historian of science Thomas Kuhn argued 
that Einstein should be given credit for quantum theory more so than 
Planck, since Planck did not understand in a deep sense that he was 
"introducing the quantum" as a real physical entity. Be that as it may, 
it was in recognition of Planck's monumental accomplishment that he 
was awarded the Nobel Prize in Physics in 1918. 

The discovery of Planck's constant enabled him to define a new 
universal set of physical units (such as the Planck length and the 
Planck mass), all based on fundamental physical constants. 

Subsequently, Planck tried to grasp the meaning of energy quanta, but to no avail. "My unavailing attempts to 
somehow reintegrate the action quantum into classical theory extended over several years and caused me much 
trouble." Even several years later, other physicists like Rayleigh, Jeans, and Lorentz set Planck's constant to zero in 
order to align with classical physics, but Planck knew well that this constant had a precise nonzero value. "I am 
unable to understand Jeans' stubbornness — he is an example of a theoretician as should never be existing, the same 
as Hegel was for philosophy. So much the worse for the facts, if they are wrong." 

Max Born wrote about Planck: "He was by nature and by the tradition of his family conservative, averse to 
revolutionary novelties and skeptical towards speculations. But his belief in the imperative power of logical thinking 
based on facts was so strong that he did not hesitate to express a claim contradicting to all tradition, because he had 
convinced himself that no other resort was possible." 




Planck in 1918, the year he received the Nobel 
Prize in Physics for his work on quantum theory 



Einstein and the theory of relativity 

In 1905 the three epochal papers of the hitherto completely unknown Albert Einstein were published in the journal 
Annalen der Physik. Planck was among the few who immediately recognized the significance of the special theory of 
relativity. Thanks to his influence this theory was soon widely accepted in Germany. Planck also contributed 
considerably to extend the special theory of relativity. 

Einstein's hypothesis of light quanta (photons), based on Philipp Lenard's 1902 discovery of the photoelectric effect, 
was initially rejected by Planck. He was unwilling to discard completely Maxwell's theory of electrodynamics. "The 
theory of light would be thrown back not by decades, but by centuries, into the age when Christian Huygens dared to 
fight against the mighty emission theory of Isaac Newton ..." 

In 1910 Einstein pointed out the anomalous behavior of specific heat at low temperatures as another example of a 
phenomenon which defies explanation by classical physics. Planck and Nernst, seeking to clarify the increasing 
number of contradictions, organized the First Solvay Conference (Brussels 1911). At this meeting Einstein was able 
to convince Planck. 

Meanwhile Planck had been appointed dean of Berlin University, whereby it was possible for him to call Einstein to 
Berlin and establish a new professorship for him (1914). Soon the two scientists became close friends and met 
frequently to play music together. 
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World War I 

At the onset of the First World War Planck endorsed the general excitement of the public, writing that, "... besides of 
much horrible also much unexpectedly great and beautiful: the swift solution of the most difficult issues of domestic 
policy through arrangement of all parties... the higher esteem for all that is brave and truthful..." 

Nonetheless, Planck refrained from the extremes of nationalism. In 1915, at a time when Italy was about to join the 
Allied Powers, he voted successfully for a scientific paper from Italy, which received a prize from the Prussian 
Academy of Sciences, where Planck was one of four permanent presidents. 

Planck also signed the infamous "Manifesto of the 93 intellectuals", a pamphlet of polemic war propaganda (while 
Einstein retained a strictly pacifistic attitude which almost led to his imprisonment, being spared by his Swiss 
citizenship). But in 1915 Planck, after several meetings with Dutch physicist Lorentz, he revoked parts of the 
Manifesto. Then in 1916 he signed a declaration against German annexationism. 

Post War and Weimar Republic 

In the turbulent post-war years, Planck, now the highest authority of German physics, issued the slogan "persevere 
and continue working" to his colleagues. 

In October 1920 he and Fritz Haber established the Notgemeinschaft der Deutschen Wissenschaft (Emergency 
Organization of German Science), aimed at providing financial support for scientific research. A considerable 
portion of the monies the organization would distribute were raised abroad. 

Planck also held leading positions at Berlin University, the Prussian Academy of Sciences, the German Physical 
Society and the Kaiser- Wilhelm-Gesellschaft (which in 1948 became the Max-Planck-Gesellschaft). During this 
time economic conditions in Germany were such that he was hardly able to conduct research. 

During the interwar period, Planck became a member of the Deutsche Volks-Partei (German People's Party), the 
party of Nobel Peace Prize laureate Gustav Stresemann, which aspired to liberal aims for domestic policy and rather 
revisionistic aims for international politics. 

Planck disagreed with the introduction of universal suffrage and later expressed the view that the Nazi dictatorship 
resulted from "the ascent of the rule of the crowds". 

Quantum mechanics 

At the end of the 1920s Bohr, Heisenberg and Pauli had worked out the Copenhagen interpretation of quantum 
mechanics, but it was rejected by Planck, as well as Schrodinger, Laue, and Einstein. Planck expected that wave 
mechanics would soon render quantum theory — his own child — unnecessary. This was not to be the case, however. 
Further work only cemented quantum theory, even against his and Einstein's philosophical revulsions. Planck 
experienced the truth of his own earlier observation from his struggle with the older views in his younger years: "A 
new scientific truth does not triumph by convincing its opponents and making them see the light, but rather because 
its opponents eventually die, and a new generation grows up that is familiar with it." 

Nazi dictatorship and The Second World War 

When the Nazis seized power in 1933, Planck was 74. He witnessed many Jewish friends and colleagues expelled 
from their positions and humiliated, and hundreds of scientists emigrated from Germany. Again he tried the 
"persevere and continue working" slogan and asked scientists who were considering emigration to remain in 
Germany. He hoped the crisis would abate soon and the political situation would improve. There was also a deeper 
argument against emigration. Emigrating German non- Jewish scientists would need to look for academic positions 
abroad, but these positions better served Jewish scientists, who had no chance of continuing to work in Germany. 

Hahn asked Planck to gather well-known German professors in order to issue a public proclamation against the 
treatment of Jewish professors, but Planck replied, "If you are able to gather today 30 such gentlemen, then 
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tomorrow 150 others will come and speak against it, because they are eager to take over the positions of the 

rm 

others.' Under Planck's leadership, the Kaiser- Wilhelm-Gesellschaft (KWG) avoided open conflict with the Nazi 
regime, except concerning Fritz Haber. Planck tried to discuss the issue with Adolf Hitler but was unsuccessful. In 
the following year, 1934, Haber died in exile. 

One year later, Planck, having been the president of the KWG since 1930, organized in a somewhat provocative style 
an official commemorative meeting for Haber. He also succeeded in secretly enabling a number of Jewish scientists 
to continue working in institutes of the KWG for several years. In 1936, his term as president of the KWG ended, 
and the Nazi government pressured him to refrain from seeking another term. 

As the political climate in Germany gradually became more hostile, Johannes Stark, prominent exponent of Deutsche 
Physik ("German Physics", also called "Aryan Physics") attacked Planck, Sommerfeld and Heisenberg for 
continuing to teach the theories of Einstein, calling them "white Jews." The "Hauptamt Wissenschaft" (Nazi 
government office for science) started an investigation of Planck's ancestry, but all they could find out was that he 
was "1/16 Jewish." 

In 1938 Planck celebrated his 80th birthday. The DPG held a celebration, during which the Max-Planck medal 
(founded as the highest medal by the DPG in 1928) was awarded to French physicist Louis de Broglie. At the end of 
1938 the Prussian Academy lost its remaining independence and was taken over by Nazis (Gleichschaltung). Planck 
protested by resigning his presidency. He continued to travel frequently, giving numerous public talks, such as his 
talk on Religion and Science, and five years later he was sufficiently fit to climb 3,000-meter peaks in the Alps. 

During the Second World War, the increasing number of Allied bombing campaigns against Berlin forced Planck 
and his wife to leave the city temporarily and live in the countryside. In 1942 he wrote: "In me an ardent desire has 
grown to persevere this crisis and live long enough to be able to witness the turning point, the beginning of a new 
rise." In February 1944 his home in Berlin was completely destroyed by an air raid, annihilating all his scientific 
records and correspondence. Finally, he got into a dangerous situation in his rural retreat because of the rapid 
advance of the Allied armies from both sides. After the end of the war he was brought to a relative in Gottingen. 

Planck endured many personal tragedies after the age of 50. In 1909, his first wife died after 22 years of marriage, 
leaving him with two sons and twin daughters. Planck's oldest son, Karl, was killed in action in 1916. His daughter 
Margarete died in childbirth in 1917, and another daughter, Emma, married her late sister's husband and then also 
died in childbirth, in 1919. During World War II, Planck's house in Berlin was completely destroyed by bombs in 
1944 and his youngest son, Erwin, was implicated in the attempt made on Hitler's life in the July 20 plot. 
Consequently, Erwin died at the hands of the Gestapo in 1945. Erwin's death destroyed Planck's will to liveJ 10 ^ By 
the end of the war, Planck, his second wife and his son by her, moved to Gottingen where he died on October 4, 
1947. 

Religious view 

Planck was a devoted and persistent adherent of Christianity from early life to death, but he was very tolerant 
towards alternative views and religions, and so was discontented with the Nazi church organizations' demands for 
unquestioning belief. 

The God in which Planck believed was an almighty, all-knowing, benevolent but unintelligible God that permeated 
everything, manifest through symbols, including physical laws. His view may have been motivated by an opposition 
like Einstein's and Schrodinger's against the positivist view. Planck was interested in truth and a Universe beyond 
observation, and objected to atheism as an obsession with symbols. 

Planck regarded the scientist as a man of imagination and faith, "faith" interpreted as being similar to "having a 
working hypothesis". For example the causality principle isn't true or false, it is an act of faith. Thereby Planck may 
have indicated a view that points toward Imre Lakatos' research programs process descriptions, where falsification is 
mostly tolerable, in faith of its future removal P ^ 
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On the other hand, Planck wrote, M ...'to believe' means 'to recognize as a truth,' and the knowledge of nature, 

continually advancing on incontestably safe tracks, has made it utterly impossible for a person possessing some 

training in natural science to recognize as founded on truth the many reports of extraordinary contradicting the laws 

of nature, of miracles which are still commonly regarded as essential supports and confirmations of religious 

ri2i 

doctrines, and which formerly used to be accepted as facts pure and simple, without doubt or criticism." 



Honors and awards 




• "Pour le Merite" for Science and Arts 1915 (in 1930 he became 
chancellor of this order) 

• Nobel Prize in Physics 1918 (awarded 1919) 

• Lorentz Medal 1927 

• Franklin Medal (1927) 

• Adlerschild des Deutschen Reiches (1928), an award from the 
German Reich President 

• Max Planck medal (1929, together with Einstein) 

• Copley Medal (1929) 

• Planck received honorary doctorates from the universities of Frankfurt, Munich (TH), Rostock, Berlin (TH), 
Graz, Athens, Cambridge, London, and Glasgow. 

• The asteroid 1069 was named "Stella Planckia" by the International Astronomical Union (1938) 



The Max Planck two Deutsche Mark coin 



Publications 

• Planck, Max. (1897). Vorlesungen ilber Thermodynamik 
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• Planck, Max. (1900). "Entropy and Temperature of Radiant Heat . " Annalen der Physik, vol. 1. no 4. April, 
pg. 719-37. 



• Planck, Max. (1901). " On the Law of Distribution of Energy in the Normal Spectrum 
vol. 4, p. 553 ff. 
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. Annalen der Physik, 
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Born 


15 August 1892Dieppe, France 


Died 


19 March 1987 (aged 94)Louveciennes, France 


Nationality 


French 


Fields 


Physics 


Institutions 


Sorbonne 
University of Paris 


Alma mater 


Sorbonne 


Doctoral advisor 


Paul Langevin 


Doctoral students 


Jean-Pierre Vigier 
Alexandru Proca 


Known for 


Wave nature of electrons 
de Broglie wavelength 


Notable awards 


Nobel Prize in Physics (1929) 



Louis- Victor-Pierre-Raymond, 7th due de Broglie, FRS (English pronunciation: /de'broi/; French: [d9 btfcej] ( 4)) listen); 

15 August 1892 - 19 March 1987) was a French physicist and a Nobel laureate. He was the sixteenth member 
elected to occupy seat 1 of the Academie francaise in 1944, and served as Perpetual Secretary of the Academie des 
sciences, France. 

Biography 

Louis de Broglie was born to a noble family in Dieppe, Seine-Maritime, younger son of Victor, 5th due de Broglie. 
He became the 7th due de Broglie upon the death without heir in 1960 of his older brother, Maurice, 6th due de 
Broglie, also a physicist. He did not marry. When he died in Louveciennes, he was succeeded as duke by a distant 
cousin, Victor-Francois, 8th due de Broglie. 

De Broglie had originally intended a career in humanities, and received his first degree in history. Afterwards, 
though, he turned his attention toward mathematics and physics and received a degree in physics. With the outbreak 
of the First World War in 1914, he offered his services to the army in the development of radio communications. 

His 1924 doctoral thesis, Recherches sur la theorie des quanta (Research on Quantum Theory), introduced his 
theory of electron waves. This included the wave-particle duality theory of matter, based on the work of Max Planck 
and Albert Einstein on light. The thesis examiners, unsure of the material, passed his thesis to Einstein for evaluation 
who endorsed his wave-particle duality proposal wholeheartedly; de Broglie was awarded his doctorate. This 
research culminated in the de Broglie hypothesis stating that any moving particle or object had an associated wave. 
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De Broglie thus created a new field in physics, the mecanique ondulatoire, or wave mechanics, uniting the physics of 
energy (wave) and matter (particle). For this he won the Nobel Prize in Physics in 1929. 

In his later career, de Broglie worked to develop a causal explanation of wave mechanics, in opposition to the wholly 
probabilistic models which dominate quantum mechanical theory; it was refined by David Bohm in the 1950s. 

In addition to strictly scientific work, de Broglie thought and wrote about the philosophy of science, including the 
value of modern scientific discoveries. 

De Broglie became a member of the Academie des sciences in 1933, and was the academy's perpetual secretary from 
1942. On 12 October 1944, he was elected to the Academie francaise, replacing mathematician Emile Picard. 
Because of the deaths and imprisonments of Academie members during the occupation and other effects of the war, 
the Academie was unable to meet the quorum of twenty members for his election; due to the exceptional 
circumstances, however, his unanimous election by the seventeen members present was accepted. In an event unique 
in the history of the Academie, he was received as a member by his own brother Maurice, who had been elected in 
1934. UNESCO awarded him the first Kalinga Prize in 1952 for his work in popularizing scientific knowledge, and 
he was elected a Foreign Member of the Royal Society on 23 April 1953. In 1961 he received the title of Knight of 
the Grand Cross in the Legion d'honneur. De Broglie was awarded a post as counselor to the French High 
Commission of Atomic Energy in 1945 for his efforts to bring industry and science closer together. He established a 
center for applied mechanics at the Henri Poincare Institute, where research into optics, cybernetics, and atomic 
energy were carried out. He inspired the formation of the International Academy of Quantum Molecular Science and 
was an early member. 

Honours and awards 

• 1929 Nobel Prize in Physics 

• 1929 Henri Poincare Medal 

• 1932 Albert I of Monaco Prize 

• 1938 Max Planck Medal 

• 1938 Fellow, Royal Swedish Academy of Sciences 

• 1944 Fellow, Academie francaise 

• 1952 Kalinga Prize 

• 1953 Fellow, Royal Society 
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• Rapport au 5e Conseil de Physique Solvay. Brussels, 1927. 

• La mecanique ondulatoire (Wave Mechanics). Paris: Gauthier-Villars, 1928. 

• Matiere et lumiere (Matter and Light). Paris: Albin Michel, 1937. 

• Une tentative d' interpretation causale et non lineaire de la mecanique ondulatoire: la theorie de la double 
solution. Paris: Gauthier-Villars, 1956. 

• English translation: Non-linear Wave Mechanics: A Causal Interpretation. Amsterdam: Elsevier, 1960. 

• Sur les sentiers de la science (On the Paths of Science). 

• Introduction a la nouvelle theorie des particules de M. Jean-Pierre Vigier et de ses collaborateurs. Paris: 
Gauthier-Villars, 1961. Paris: Albin Michel, 1960. 

• English translation: Introduction to the Vigier Theory of elementary particles. Amsterdam: Elsevier, 1963. 

• Etude critique des bases de V interpretation actuelle de la mecanique ondulatoire. Paris: Gauthier-Villars, 1963. 
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• English translation: The Current Interpretation of Wave Mechanics: A Critical Study. Amsterdam, Elsevier, 
1964. 

• Certitudes et incertitudes de la science (Certitudes and Incertitudes of Science). Paris: Albin Michel, 1966. 
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30 August 1871Brightwater, New Zealand 


Died 


Ly October Lyj 1 (aged oojCaniDriage, iinglana 


Residence 


New Zealand, UK, Canada 


Citizenship 


United Kingdom 


Nationality 


British-New Zealander 


Fields 


Physicist-Chemist 


Institutions 


McGill University 
University of Manchester 


Alma mater 


University of Canterbury 
Cambridge University 


Academic advisors 


Alexander Bickerton 
J. J. Thomson 


Doctoral students 


Edward Victor Appleton 
Alexander MacAulay 
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Robert William Boyle 
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Other notable students 


Mark Oliphant 




Patrick Blackett 




Hans Geiger 




Niels Bohr 




Otto Hahn 




Teddy Bullard 




Pyotr Kapitsa 




John Cockcroft 




Charles Drummond Ellis 




James Chadwick 




Ernest Marsden 




Edward Andrade 




Frederick Soddy 




Edward Victor Appleton 




Bertram Boltwood 




Kazimierz Fajans 




Charles Galton Darwin 




A. J. B. Robertson 




George Laurence 




Henry DeWolf Smyth 




Harriet Brooks 




Douglas Hartree 




Iven Mackay 




Norman Alexander 


Known for 


Father of nuclear physics 




Rutherford model 




Rutherford scattering 




Rutherford backscattering spectroscopy 




Discovery of proton 




Rutherford (unit) 




Coining the term 'artificial disintegration' 


Influenced 


Henry Moseley 




Hans Geiger 




Albert Beaumont Wood 


Notable awards 


Rumford Medal (1905) 




Nobel Prize in Chemistry (1908) 




Elliott Cresson Medal (1910) 




Matteucci Medal (1913) 




Copley Medal (1922) 




Franklin Medal (1924) 


Signature 





Ernest Rutherford, 1st Baron Rutherford of Nelson OM, FRS (30 August 1871-19 October 1937) was a New 

Zealand-born British chemist and physicist who became known as the father of nuclear physics J 1 -' In early work he 

discovered the concept of radioactive half life, proved that radioactivity involved the transmutation of one chemical 

element to another, and also differentiated and named alpha and beta radiation. This work was done at McGill 

University in Canada. It is the basis for the Nobel Prize in Chemistry he was awarded in 1908 "for his investigations 

T21 

into the disintegration of the elements, and the chemistry of radioactive substances". 

Rutherford performed his most famous work after he had moved to the U.K. in 1907 and was already a Nobel 

T31 

laureate. In 1911, he postulated that atoms have their positive charge concentrated in a very small nucleus, and 
thereby pioneered the Rutherford model, or planetary, model of the atom, through his discovery and interpretation of 
Rutherford scattering in his gold foil experiment. He is widely credited with first "splitting the atom" in 1917. This 
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led to the first experiment to split the nucleus in a controlled manner, performed by two students working under his 
direction, John Cockcroft and Ernest Walton, in 1932. 

Rutherford died in 1937, and was honoured in death by being interred near the greatest scientists of the United 
Kingdom, near Sir Isaac Newton's tomb in Westminster Abbey. The chemical element rutherfordium (element 104) 
was named for him in 1997. 

Biography 

Ernest Rutherford was the son of James Rutherford, a farmer, and his wife Martha Thompson, originally from 
Hornchurch, Essex, England J 4 ^ James had emigrated to New Zealand from Perth, Scotland, "to raise a little flax and 
a lot of children". Ernest was born at Spring Grove (now Brightwater), near Nelson, New Zealand. His first name 
was mistakenly spelled Earnest when his birth was registered 

He studied at Havelock School and then Nelson College and won a scholarship to study at Canterbury College, 
University of New Zealand where he was president of the debating society, among other things. After gaining his 
BA, MA and BSc, and doing two years of research at the forefront of electrical technology, in 1895 Rutherford 
travelled to England for postgraduate study at the Cavendish Laboratory, University of Cambridge (1895-1 898), ^ 
and he briefly held the world record for the distance over which electromagnetic waves could be detected. 

In 1898 Rutherford was appointed to succeed Hugh Longbourne Callendar in the chair of Macdonald Professor of 
physics at McGill University in Montreal, Canada, where he did the work that gained him the Nobel Prize in 
Chemistry in 1908. In 1900 he gained a DSc from the University of New Zealand. Also in 1900 he married Mary 
Georgina Newton (1876—1945); they had one daughter, Eileen Mary (1901-1930), who married Ralph Fowler. In 
1907 Rutherford moved to Britain to take the chair of physics at the University of Manchester. 

Later years and honours 

He was knighted in 1914. In 1916 he was awarded the Hector Memorial Medal. In 1919 he returned to the 

Cavendish as Director. Under him, Nobel Prizes were awarded to Chadwick for discovering the neutron (in 1932), 

Cockcroft and Walton for an experiment which was to be known as splitting the atom using a particle accelerator, 

and Appleton for demonstrating the existence of the ionosphere. He was admitted to the Order of Merit in 1925 and 

T71 

raised to the peerage as Baron Rutherford of Nelson, of Cambridge in the County of Cambridge, in 1931, a title 
that became extinct upon his unexpected death in hospital following an operation for an umbilical hernia (1937). 
Since he was a peer, British protocol at that time required that he be operated on by a titled doctor, and the delay cost 
him his life. He is interred in Westminster Abbey, alongside J. J. Thomson, and near Sir Isaac Newton. 

Scientific research 

During the investigation of radioactivity he coined the terms alpha and beta in 1 899 to describe the two distinct types 
of radiation emitted by thorium and uranium. These rays were differentiated on the basis of penetrating power. From 
1900 to 1903 he was joined at McGill by the young Frederick Soddy (Nobel Prize in Chemistry, 1921) and they 
collaborated on research into the transmutation of elements. Rutherford had demonstrated that radioactivity was the 
spontaneous disintegration of atoms. He noticed that a sample of radioactive material invariably took the same 
amount of time for half the sample to decay — its "half-life" — and created a practical application using this constant 
rate of decay as a clock, which could then be used to help determine the age of the Earth, which turned out to be 
much older than most of the scientists at the time believed. 

In 1903, Rutherford realised that a type of radiation from radium discovered (but not named) by French chemist Paul 
Villard in 1900, must represent something different from alpha rays and beta rays, due to its very much greater 
penetrating power. Rutherford gave this third type of radiation its name also: the gamma ray. 
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In Manchester he continued to work with alpha radiation, in conjunction with Hans Geiger he developed zinc 
sulphide scintillation screens and ionisation chambers to count alphas. By dividing the total charge they produced by 
the number counted, Rutherford decided that the charge on the alpha was two. In late 1907 Ernest Rutherford and 
Thomas Royds allowed alphas to penetrate a very thin window into an evacuated tube. As they sparked the tube into 
discharge, the spectrum obtained from it changed, as the alphas were trapped. Eventually, the clear spectrum of 
helium gas appeared, proving that alphas were at least ionised helium atoms, and probably helium nuclei. 

Along with Hans Geiger and Ernest Marsden he carried out the Geiger— Marsden experiment in 1909, which 
demonstrated the nuclear nature of atoms. Rutherford was inspired to ask Geiger and Marsden in this experiment to 
look for alpha particles with very high deflection angles, of a type not expected from any theory of matter at that 
time. Such deflections, through rare, were found, and proved to be a smooth but high-order function of the deflection 
angle. It was Rutherford's interpretation of this data that led him to formulate the Rutherford model of the atom in 
1911 — that a very small positively charged nucleus was orbited by electrons. 

In Cambridge in 1919, after taking over the Cavendish laboratory, Rutherford became the first person to transmute 
one element into another when he converted nitrogen into oxygen through the nuclear reaction 14 N + a — > 17 0 + 
p. In 1921, while working with Niels Bohr (who postulated that electrons moved in specific orbits), Rutherford 
theorised about the existence of neutrons, which could somehow compensate for the repelling effect of the positive 
charges of protons by causing an attractive nuclear force and thus keeping the nuclei from breaking apart. 
Rutherford's theory of neutrons was proved in 1932 by his associate James Chadwick, who in 1935 was awarded the 
Nobel Prize in Physics for this discovery. 

Legacy 



Nuclear physics 

Rutherford's research, and work done under him as laboratory director, 
established the nuclear structure of the atom and the essential nature of 
radioactive decay. Rutherford's team also demonstrated artificially 
induced nuclear transmutation. He is known as the father of nuclear 
physics. Rutherford died too early to see Leo Szilard's idea of 
controlled nuclear chain reactions come into being. A speech of 
Rutherford's printed in the September 12, 1933 London paper The 
Times is reported by Szilard to have been his inspiration for thinking a plaque commemorating Rutherford's presence 
of the possiblity of a controlled nuclear chain reaction, in London, on at the Victoria University, Manchester 

the same day. Rutherford's speech, in part, read: 

We might in these processes obtain very much more energy than the proton supplied, but on the average we 
could not expect to obtain energy in this way. It was a very poor and inefficient way of producing energy, and 
anyone who looked for a source of power in the transformation of the atoms was talking moonshine. But the 
subject was scientifically interesting because it gave insight into the atomsP^ 
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Items named in honour of Rutherford's life and work 

Scientific discoveries 

• the element rutherfordium, Rf, Z=104. (1997) [10] 
Institutions 

• Rutherford Appleton Laboratory, a scientific research laboratory near Abingdon, Oxfordshire, UK. 

• Rutherford College, a school in Auckland, New Zealand 

• Rutherford College, a college at the University of Kent in Canterbury, UK 

• the Rutherford Institute for Innovation at the University of Cambridge, UK 

• Rutherford Intermediate School, Wanganui, New Zealand 

Buildings 

• a building of the modern Cavendish Laboratory at the University of Cambridge, UK 

• The Ernest Rutherford Physics Building at McGill University, Montreal, Canada^ 1 ^ 

• the physics and chemistry building at the University of Canterbury, New Zealand 

• The Coupland Building at the University of Manchester where Rutherford worked was renamed The Rutherford 
Building in 2006. 

• The Rutherford lecture theatre in the Schuster Laboratory at the University of Manchester 
Major streets 

• Rutherford Close, a residential street in Abingdon, Oxfordshire, UK. 

• Lord Rutherford Road in Brightwater, New Zealand — his birthplace. 

• Rutherford Road in the biotech district of Carlsbad, California, USA. 

• Rutherford Street in Nelson, New Zealand. 

Other 

• The crater Rutherford on the Moon, and the crater Rutherford on Mars 

• The Rutherford Award at Thomas Carr College for excellence in VCE Chemistry, Australia 

• Image on New Zealand $100 note. 

• Rutherford was the subject of a play by Stuart Hoar. 

• On the side of the Mond Laboratory on the site of the original Cavendish Laboratory in Cambridge, there is an 
engraving in Rutherford's memory in the form of a crocodile, this being the nickname given to him by its 
commissioner, his colleague Peter Kapitza. The initials of the engraver, Eric Gill, are visible within the mouth. 

• The Rutherford Foundation, a charitable trust set up by the Royal Society of New Zealand to support research in 
science and technology J 

Publications 

• Radio-activity (1904), 2nd ed. (1905), ISBN 978-1-60355-058-1 

• Radioactive Transformations (1906), ISBN 978-1-60355-054-3 

• Radiations from Radioactive Substances (1919) 

• The Electrical Structure of Matter (1926) 

• The Artificial Transmutation of the Elements (1933) 

• The Newer Alchemy ( 1 937) 
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Famous statements 

• "The energy produced by the breaking down of the atom is a very poor kind of thing. Anyone who expects a 

ri3i 

source of power from the transformation of these atoms is talking moonshine. " — 1933 

"It was almost as if you fired a 15 inch shell into a piece of tissue paper and it came back and hit you." (describing 
the Geiger-Marsden experiment) 
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Born 



Albert Einstein in 1921 
14 March 1879Ulm, Kingdom of Wurttemberg, German Empire 



Died 

Resting place 



18 April 1955 (aged 76)Princeton, New Jersey, United States 
Grounds of the Institute for Advanced Study, Princeton, New Jersey. 



Residence 
Ethnicity 



Germany, Italy, Switzerland, United States 
Jewish 



Citizenship 



Wiirttemberg/Germany (until 1896) 
Stateless (1896-1901) 
Switzerland (from 1901) 
Austria (1911-12) 
Germany (1914-33) 



United States (from 1940) 



[1] 



Alma mater 



Known for 



ETH Zurich 
University of Zurich 

General relativity and special relativity 
Photoelectric effect 
Mass-energy equivalence 
Quantification of the Brownian motion 
Einstein field equations 
Bose-Einstein statistics 
Unified Field Theory 



Spouse 



Awards 



Mileva Marie (1903-1919) 

Elsa Lowenthal, nee Einstein, (1919-1936) 

Nobel Prize in Physics (1921) 
Copley Medal (1925) 
Max Planck Medal (1929) 
Time Person of the Century 
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Albert Einstein (pronounced /'selb9rt 'ainstain/; German: ['albBt 'ainjtain] ( 4)) listen); 14 March 1879- 18 April 
1955) was a theoretical physicist, philosopher and author who is widely regarded as one of the most influential and 
iconic scientists and intellectuals of all time. A German-Swiss Nobel laureate, Einstein is often regarded as the father 
of modern physics. He received the 1921 Nobel Prize in Physics "for his services to theoretical physics, and 
especially for his discovery of the law of the photoelectric effect". 

Near the beginning of his career, Einstein thought that Newtonian mechanics was no longer enough to reconcile the 

laws of classical mechanics with the laws of the electromagnetic field. This led to the development of his special 

theory of relativity. He realized, however, that the principle of relativity could also be extended to gravitational 

fields, and with his subsequent theory of gravitation in 1916, he published a paper on the general theory of relativity. 

He continued to deal with problems of statistical mechanics and quantum theory, which led to his explanations of 

particle theory and the motion of molecules. He also investigated the thermal properties of light which laid the 

foundation of the photon theory of light. In 1917, Einstein applied the general theory of relativity to model the 

Mi 

structure of the universe as a whole. 

On the eve of World War II in 1939, he personally alerted President Franklin D. Roosevelt that Germany might be 
developing an atomic weapon, and recommended that the U.S. begin uranium procurement and nuclear research. As 
a result, Roosevelt advocated such research, leading to the creation of the top secret Manhattan Project, and the U.S. 
becoming the first and only country to possess nuclear weapons during the war. Days before his death, however, 
Einstein signed the Russell-Einstein Manifesto, that highlighted the dangers posed by the military usage of nuclear 
energy. 

Einstein published more than 300 scientific along with over 150 non-scientific works, and received honorary 

Mi 

doctorate degrees in science, medicine and philosophy from many European and American universities; he also 
wrote about various philosophical and political subjects such as socialism, international relations and the existence of 
God.^ His great intelligence and originality has made the word "Einstein" synonymous with genius ^ 

Biography 

Early life and education 
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Albert Einstein was born in Ulm, in the Kingdom of Wurttemberg in the German 
Empire on 14 March 1879. His father was Hermann Einstein, a salesman and 
engineer. His mother was Pauline Einstein (nee Koch). In 1880, the family 
moved to Munich, where his father and his uncle founded Elektrotechnische 
Fabrik J. Einstein & Cie, a company that manufactured electrical equipment 



based on direct current 



[V] 



The Einsteins were non-observant Jews. Their son attended a Catholic 
elementary school from the age of five until ten. 1 Although Einstein had early 
speech difficulties, he was a top student in elementary school ^ 

His father once showed him a pocket compass; Einstein realized that there must 
be something causing the needle to move, despite the apparent "empty space" P ^ 
As he grew, Einstein built models and mechanical devices for fun and began to 
show a talent for mathematics. In 1889, Max Talmud (later changed to Max 
Talmey) introduced the ten-year old Einstein to key texts in science, mathematics 
and philosophy, including Immanuel Kant's Critique of Pure Reason and Euclid's 
Elements (which Einstein called the "holy little geometry book"). Talmud was 
a poor Jewish medical student from Poland. The Jewish community arranged for 
Talmud to take meals with the Einsteins each week on Thursdays for six years. 
During this time Talmud wholeheartedly guided Einstein through many secular 
educational interests. 11 3] tl4] 




Albert Einstein in 1893 (age 14). 



In 1894, his father's company failed: direct current (DC) lost the War of Currents to alternating current (AC). In 
search of business, the Einstein family moved to Italy, first to Milan and then, a few months later, to Pavia. When the 
family moved to Pavia, Einstein stayed in Munich to finish his studies at the Luitpold Gymnasium. His father 
intended for him to pursue electrical engineering, but Einstein clashed with authorities and resented the school's 
regimen and teaching method. He later wrote that the spirit of learning and creative thought were lost in strict rote 
learning. In the spring of 1895, he withdrew to join his family in Pavia, convincing the school to let him go by using 
a doctor's note. 1 During this time, Einstein wrote his first scientific work, "The Investigation of the State of Aether 
in Magnetic Fields" J 15 ^ 

Einstein applied directly to the Eidgenossische Polytechnische Schule (ETH) in Zurich, Switzerland. Lacking the 

requisite Matura certificate, he took an entrance examination, which he failed, although he got exceptional marks in 

mathematics and physics J 16 ^ The Einsteins sent Albert to Aarau, in northern Switzerland to finish secondary 
T71 

school. While lodging with the family of Professor Jost Winteler, he fell in love with the family's daughter, Marie. 
(His sister Maja later married the Wintelers' son Paul.) In Aarau, Einstein studied Maxwell's electromagnetic 
theory. At age 17, he graduated, and, with his father's approval, renounced his citizenship in the German Kingdom of 
Wurttemberg to avoid military service, and in 1896 he enrolled in the four year mathematics and physics teaching 
diploma program at the Polytechnic in Zurich. Marie Winteler moved to Olsberg, Switzerland for a teaching post. 
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Einstein's future wife, Mileva Marie, also enrolled at the Polytechnic that same year, the only woman among the six 
students in the mathematics and physics section of the teaching diploma course. Over the next few years, Einstein 
and Marie's friendship developed into romance, and they read books together on extra-curricular physics in which 
Einstein was taking an increasing interest. In 1900 Einstein was awarded the Zurich Polytechnic teaching diploma, 

MOT 

but Marie failed the examination with a poor grade in the mathematics component, theory of functions. There 

have been claims that Marie collaborated with Einstein on his celebrated 1905 papers, ^ ^ but historians of 

r211 T221 T231 T241 

physics who have studied the issue find no evidence that she made any substantive contributions. 

Marriages and children 

In early 1902, Einstein and Mileva Marie had a daughter they named Lieserl in their correspondence, who was born 
in Novi Sad where Marie's parents lived. Her full name is not known, and her fate is uncertain after 1903. 

Einstein and Marie married in January 1903. In May 1904, the couple's first son, Hans Albert Einstein, was born in 
Bern, Switzerland. Their second son, Eduard, was born in Zurich in July 1910. In 1914, Einstein moved to Berlin, 
while his wife remained in Zurich with their sons. Marie and Einstein divorced on 14 February 1919, having lived 
apart for five years. 

Einstein married Elsa Lowenthal (nee Einstein) on 2 June 1919, after having had a relationship with her since 1912. 

She was his first cousin maternally and his second cousin paternally. In 1933, they emigrated permanently to the 

[27] 

United States. In 1935, Elsa Einstein was diagnosed with heart and kidney problems and died in December 1936. 



Patent office 




Left to right: Conrad Habicht, Maurice Solovine 
and Einstein, who founded the Olympia Academy 




After graduating, Einstein spent almost two frustrating years searching 
for a teaching post, but a former classmate's father helped him secure a 
job in Bern, at the Federal Office for Intellectual Property, the patent 
office, as an assistant examiner. 1 He evaluated patent applications 
for electromagnetic devices. In 1903, Einstein's position at the Swiss 
Patent Office became permanent, although he was passed over for 
promotion until he "fully mastered machine technology" P 9 ^ 

Much of his work at the patent office related to questions about 
transmission of electric signals and electrical-mechanical 
synchronization of time, two technical problems that show up 
conspicuously in the thought experiments that eventually led Einstein 
to his radical conclusions about the nature of light and the fundamental 
connection between space and timeJ 30 ^ 

With a few friends he met in Bern, Einstein started a small discussion 
group, self-mockingly named "The Olympia Academy", which met 
regularly to discuss science and philosophy. Their readings included 
the works of Henri Poincare, Ernst Mach, and David Hume, which 
influenced his scientific and philosophical outlook. 



Einstein's home in Bern 
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Academic career 

In 1901, Einstein had a paper on the capillary forces of a straw 
published in the prestigious Annalen der Physik. On 30 April 1905, 
he completed his thesis, with Alfred Kleiner, Professor of Experimental 
Physics, serving as pro-forma advisor. Einstein was awarded a PhD by 
the University of Zurich. His dissertation was entitled "A New 
Determination of Molecular Dimensions". That same year, which has 
been called Einstein's annus mirabilis or "miracle year", he published 
four groundbreaking papers, on the photoelectric effect, Brownian 
motion, special relativity, and the equivalence of matter and energy, 
which were to bring him to the notice of the academic world. 

By 1908, he was recognized as a leading scientist, and he was appointed 
lecturer at the University of Berne. The following year, he quit the 
patent office and the lectureship to take the position of physics 
docent at the University of Zurich. He became a full professor at 
Karl-Ferdinand University in Prague in 1911. In 1914, he returned to 
Germany after being appointed director of the Kaiser Wilhelm Institute 
for Physics (1914-1932) and a professor at the Humboldt University 
of Berlin, although with a special clause in his contract that freed him 
from most teaching obligations. He became a member of the Prussian Academy of Sciences. In 1916, Einstein was 
appointed president of the German Physical Society (1916-1918). [35] [36] 

In 1911, he had calculated that, based on his new theory of general relativity, light from another star would be bent 
by the Sun's gravity. That prediction was claimed confirmed by observations made by a British expedition led by Sir 
Arthur Eddington during the solar eclipse of May 29, 1919. International media reports of this made Einstein world 
famous. On 7 November 1919, the leading British newspaper The Times printed a banner headline that read: 
"Revolution in Science - New Theory of the Universe - Newtonian Ideas Overthrown". (Much later, questions 
were raised whether the measurements were accurate enough to support Einstein's theory.) 

In 1921, Einstein was awarded the Nobel Prize in Physics. Because relativity was still considered somewhat 
controversial, it was officially bestowed for his explanation of the photoelectric effect. He also received the Copley 
Medal from the Royal Society in 1925. 




Einstein's official 1921 portrait after receiving 
the Nobel Prize in Physics. 



Travels abroad 

Einstein visited New York City for the first time on 2 April 1921. When asked where he got his scientific ideas, 
Einstein explained that he believed scientific work best proceeds from an examination of physical reality and a 
search for underlying axioms, with consistent explanations that apply in all instances and avoid contradicting each 

T381 

other. He also recommended theories with visualizable results. (Einstein 1954) 

In 1922, he traveled throughout Asia and later to Palestine, as part of a six-month excursion and speaking tour. His 

travels included Singapore, Ceylon, and Japan, where he gave a series of lectures to thousands of Japanese. His first 

lecture in Tokyo lasted four hours, after which he met the emperor and empress at the Imperial Palace where 

T391 '307 

thousands came to watch. Einstein later gave his impressions of the Japanese in a letter to his sons: ' "Of all the 

T391 

people I have met, I like the Japanese most, as they are modest, intelligent, considerate, and have a feel for art.' 

:308 

On his return voyage, he also visited Palestine for twelve days in what would become his only visit to that region. 
"He was greeted with great British pomp, as if he were a head of state rather than a theoretical physicist", writes 
Isaacson. This included a cannon salute upon his arrival at the residence of the British high commissioner, Sir 
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Herbert Samuel. During one reception given to him, the building was "stormed by throngs who wanted to hear him". 
In Einstein's talk to the audience, he expressed his happiness over the event: 

I consider this the greatest day of my life. Before, I have always found something to regret in the Jewish 
soul, and that is the forgetfulness of its own people. Today, I have been made happy by the sight of the 
Jewish people learning to recognize themselves and to make themselves recognized as a force in the 
world. [40]:308 



Emigration from Germany 

In 1933, Einstein was compelled to immigrate to the United States due 
to the rise to power of the Nazis under Germany's new chancellor, 
Adolf HitlerJ 41 ^ While visiting American universities in April, 1933, 
he learned that the new German government had passed a law barring 
Jews from holding any official positions, including teaching at 
universities. A month later, the Nazi book burnings occurred, with 
Einstein's works being among those burnt, and Nazi propaganda 
minister Joseph Goebbels proclaimed, "Jewish intellectualism is 
dead." t40] Einstein also learned that his name was on a list of 
assassination targets, with a "$5,000 bounty on his head". One German 
magazine included him in a list of enemies of the German regime with 
the phrase, "not yet hanged" J 40 ^ ^ 




Next to Oliver Locker-Lampson, being protected 
in Norfolk, England, after escaping Nazi 
Germany in 1933 



Among other German scientists forced to flee were fourteen Nobel 
laureates and twenty-six of the sixty professors of theoretical physics 
in the country. Among the other scientists who left Germany, or the 

other countries it came to dominate, were Edward Teller, Niels Bohr, Enrico Fermi, Otto Stern, Victor Weisskopf, 
Hans Bethe, and Lise Meitner, many of whom made certain that the Allies would develop nuclear weapons first, 
before the Nazis J 40 ^ With so many other Jewish scientists now forced by circumstances to live in America, often 
working side by side, Einstein wrote to a friend, "For me the most beautiful thing is to be in contact with a few fine 
Jews — a few millennia of a civilized past do mean something after all." In another letter he writes, "In my whole life 
I have never felt so Jewish as now."'" 40 -' 

T431 

He took up a position at the Institute for Advanced Study at Princeton, New Jersey, an affiliation that lasted until 
his death in 1955. There, he tried unsuccessfully to develop a unified field theory and to refute the accepted 
interpretation of quantum physics. He and Kurt Godel, another Institute member, became close friends. They would 
take long walks together discussing their work. His last assistant was Bruria Kaufman, who later became a renowned 
physicist. 

World War II and the Manhattan Project 

In the summer of 1939, a few months before the beginning of World War II, Einstein was persuaded to write a letter 
to President Franklin D. Roosevelt and warn him that Nazi Germany might be developing an atomic bomb. The 
letter, written with the help of Hungarian emigre physicist Leo Szilard, gave the letter more prestige, with Einstein 
also recommending that the U.S. begin uranium enrichment and nuclear research. According to F.G. Gosling of the 
U.S. Department of Energy, Einstein, Szilard, and other refugees including Edward Teller and Eugene Wigner, 
"regarded it as their responsibility to alert Americans to the possibility that German scientists might win the race to 
build an atomic bomb, and to warn that Hitler would be more than willing to resort to such a weapon. " t44] 

British columnist Ambrose Evans-Pritchard notes, however, that Washington at first "brushed off with disbelief" the 
fears they expressed. He then describes how quickly Roosevelt changed his mind: 
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Albert Einstein interceded through the Belgian queen mother, eventually getting a personal envoy into 
the Oval Office. Roosevelt initially fobbed him off. He listened more closely at a second meeting over 
breakfast the next day, then made up his mind within minutes. "This needs action," he told his military 
aide. It was the birth of the Manhattan Project.'- 45 -' 

Gosling adds that "the President was a man of considerable action once he had chosen a direction," and believed that 
the U.S. "could not take the risk of allowing Hitler" to possess nuclear bombs. ^ Other weapons historians agree 
that the letter was "arguably the key stimulus for the U.S. adoption of serious investigations into nuclear weapons on 
the eve of the U.S. entry into World War II". As a result of Einstein's letter, and his meetings with Roosevelt, the 
U.S. entered the "race" to develop the bomb first, drawing on its "immense material, financial, and scientific 
resources". It became the only country to develop an atomic bomb during World War II as a result of its Manhattan 
Project J 46 ^ Einstein said to his old friend, Linus Pauling, in 1954, the last year of his life: "I made one great mistake 
in my life — when I signed the letter to President Roosevelt recommending that atom bombs be made; but there was 
some justification — the danger that the Germans would make them..."^ 47 ' 

U.S. citizenship 

Einstein became an American citizen in 1940. Not long after settling 
into his career at Princeton, he expressed his appreciation of the 
"meritocracy" in American culture when compared to Europe. 
According to Isaacson, he recognized the "right of individuals to say 
and think what they pleased", without social barriers, and as result, the 
individual was "encouraged" to be more creative, a trait he valued from 
his own early education. Einstein writes: 

What makes the new arrival devoted to this country is the 
democratic trait among the people. No one humbles himself 
before another person or class. . . American youth has the good 

fortune not to have its outlook troubled by outworn traditions. ^ 

:432 




As a member of the NAACP at Princeton who campaigned for the civil 
rights of African Americans, Einstein corresponded with civil rights 
activist W. E. B. Du Bois, and in 1946 Einstein called racism 
America's "worst disease" J 48] He later stated, "Race prejudice has 
unfortunately become an American tradition which is uncritically 
handed down from one generation to the next. The only remedies are 
enlightenment and education" J 49 ^ 

After the death of Israel's first president, Chaim Weizmann, in 

November 1952, Prime Minister David Ben-Gurion offered Einstein 

the position of President of Israel, a mostly ceremonial postJ 50 ^ The 

offer was presented by Israel's ambassador in Washington, Abba Eban, 

who explained that the offer "embodies the deepest respect which the 

T391 '522 

Jewish people can repose in any of its sons". ' However, Einstein 
declined, and wrote in his response that he was "deeply moved", and 
"at once saddened and ashamed" that he could not accept it: 




Einstein with David Ben Gurion, 1951 
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All my life I have dealt with objective matters, hence I lack both the natural aptitude and the experience to 
deal properly with people and to exercise official function. I am the more more distressed over these 
circumstances because my relationship with the Jewish people became my strongest human tie once I achieved 
complete clarity about our precarious position among the nations of the worldP 9 ^ 522 ^ '- 51 -' 



Death 

On April 17, 1955, Albert Einstein experienced internal bleeding 
caused by the rupture of an abdominal aortic aneurysm, which had 
previously been reinforced surgically by Dr. Rudolph Nissen in 
1948. He took the draft of a speech he was preparing for a 
television appearance commemorating the State of Israel's seventh 
anniversary with him to the hospital, but he did not live long enough to 
complete it. Einstein refused surgery, saying: "I want to go when I 
want. It is tasteless to prolong life artificially. I have done my share, it 
is time to go. I will do it elegantly.' He died in Princeton Hospital 
early the next morning at the age of 76, having continued to work until 
near the end. 
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DR. EINSTEIN IS DEAD AT 76 




The New York World-Telegram announces 
Einstein's death on April 18, 1955. 



Einstein's remains were cremated and his ashes were scattered around the grounds of the Institute for Advanced 
Study P 5 ^ t56] During the autopsy, the pathologist of Princeton Hospital, Thomas Stoltz Harvey, removed Einstein's 
brain for preservation, without the permission of his family, in hope that the neuroscience of the future would be able 
to discover what made Einstein so intelligent. 1 



Scientific career 

Throughout his life, Einstein published hundreds of books and articles. 
Most were about physics, but a few expressed leftist political opinions 
about pacifism, socialism, and zionism.^ ^ In addition to the work he 
did by himself he also collaborated with other scientists on additional 
projects including the Bose-Einstein statistics, the Einstein refrigerator 
and others. [58] 

Physics in 1900 

Einstein's early papers all come from attempts to demonstrate that 
atoms exist and have a finite nonzero size. At the time of his first paper 
in 1902, it was not yet completely accepted by physicists that atoms 
were real, even though chemists had good evidence ever since Antoine 
Lavoisier's work a century earlier. The reason physicists were skeptical 
was because no 19th century theory could fully explain the properties 
of matter from the properties of atoms. 




Albert Einstein in 1904. 



Ludwig Boltzmann was a leading 19th century atomist physicist, who 

had struggled for years to gain acceptance for atoms. Boltzmann had given an interpretation of the laws of 
thermodynamics, suggesting that the law of entropy increase is statistical. In Boltzmann's way of thinking, the 
entropy is the logarithm of the number of ways a system could be configured inside. The reason the entropy goes up 

is only because it is more likely for a system to go from a special state with only a few possible internal 
configurations to a more generic state with many. While Boltzmann's statistical interpretation of entropy is 
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universally accepted today, and Einstein believed it, at the turn of the 20th century it was a minority position. 

The statistical idea was most successful in explaining the properties of gases. James Clerk Maxwell, another leading 
atomist, had found the distribution of velocities of atoms in a gas, and derived the surprising result that the viscosity 
of a gas should be independent of density. Intuitively, the friction in a gas would seem to go to zero as the density 
goes to zero, but this is not so, because the mean free path of atoms becomes large at low densities. A subsequent 
experiment by Maxwell and his wife confirmed this surprising prediction. Other experiments on gases and vacuum, 
using a rotating slitted drum, showed that atoms in a gas had velocities distributed according to Maxwell's 
distribution law. 

In addition to these successes, there were also inconsistencies. Maxwell noted that at cold temperatures, atomic 
theory predicted specific heats that are too large. In classical statistical mechanics, every spring-like motion has 
thermal energy kT on average at temperature T, so that the specific heat of every spring is Boltzmann's constant & . 
A monatomic solid with N atoms can be thought of as N little balls representing N atoms attached to each other in a 
box grid with 3N springs, so the specific heat of every solid is 3Nk , a result which became known as the 
Dulong-Petit law. This law is true at room temperature, but not for colder temperatures. At temperatures near zero, 
the specific heat goes to zero. 

Similarly, a gas made up of a molecule with two atoms can be thought of as two balls on a spring. This spring has 
energy k^T at high temperatures, and should contribute an extra & B to the specific heat. It does at temperatures of 
about 1000 degrees, but at lower temperature, this contribution disappears. At zero temperature, all other 
contributions to the specific heat from rotations and vibrations also disappear. This behavior was inconsistent with 
classical physics. 

The most glaring inconsistency was in the theory of light waves. Continuous waves in a box can be thought of as 
infinitely many spring-like motions, one for each possible standing wave. Each standing wave has a specific heat of 
Al,, so the total specific heat of a continuous wave like light should be infinite in classical mechanics. This is 
obviously wrong, because it would mean that all energy in the universe would be instantly sucked up into light 
waves, and everything would slow down and stop. 

These inconsistencies led some people to say that atoms were not physical, but mathematical. Notable among the 
skeptics was Ernst Mach, whose positivist philosophy led him to demand that if atoms are real, it should be possible 
to see them directly P 9 ^ Mach believed that atoms were a useful fiction, that in reality they could be assumed to be 
infinitesimally small, that Avogadro's number was infinite, or so large that it might as well be infinite, and was 
infinitesimally small. Certain experiments could then be explained by atomic theory, but other experiments could 
not, and this is the way it will always be. 

Einstein opposed this position. Throughout his career, he was a realist. He believed that a single consistent theory 
should explain all observations, and that this theory would be a description of what was really going on, underneath 
it all. So he set out to show that the atomic point of view was correct. This led him first to thermodynamics, then to 
statistical physics, and to the theory of specific heats of solids. 

In 1905, while he was working in the patent office, the leading German language physics journal Annalen der Physik 
published four of Einstein's papers. The four papers eventually were recognized as revolutionary, and 1905 became 
known as Einstein's "Miracle Year", and the papers as the Annus Mirabilis Papers. 
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Thermodynamic fluctuations and statistical physics 

Einstein's earliest papers were concerned with thermodynamics. He wrote a paper establishing a thermodynamic 
identity in 1902, and a few other papers which attempted to interpret phenomena from a statistical atomic point of 
view. 

His research in 1903 and 1904 was mainly concerned with the effect of finite atomic size on diffusion phenomena. 
As in Maxwell's work, the finite nonzero size of atoms leads to effects which can be observed. This research, and the 
thermodynamic identity, were well within the mainstream of physics in his time. They would eventually form the 
content of his PhD thesis. ^ 

His first major result in this field was the theory of thermodynamic fluctuations. When in equilibrium, a system has a 
maximum entropy and, according to the statistical interpretation, it can fluctuate a little bit. Einstein pointed out that 
the statistical fluctuations of a macroscopic object, like a mirror suspended on spring, would be completely 
determined by the second derivative of the entropy with respect to the position of the mirror. 

Searching for ways to test this relation, his great breakthrough came in 1905. The theory of fluctuations, he realized, 
would have a visible effect for an object which could move around freely. Such an object would have a velocity 
which is random, and would move around randomly, just like an individual atom. The average kinetic energy of the 
object would be kgT, and the time decay of the fluctuations would be entirely determined by the law of friction. 

The law of friction for a small ball in a viscous fluid like water was discovered by George Stokes. He showed that 
for small velocities, the friction force would be proportional to the velocity, and to the radius of the particle (see 
Stokes' law). This relation could be used to calculate how far a small ball in water would travel due to its random 
thermal motion, and Einstein noted that such a ball, of size about a micrometre, would travel about a few 
micrometres per second. This motion could be easily detected with a microscope and indeed, as Brownian motion, 
had actually been observed by the botanist Robert Brown. Einstein was able to identify this motion with that 
predicted by his theory. Since the fluctuations which give rise to Brownian motion are just the same as the 
fluctuations of the velocities of atoms, measuring the precise amount of Brownian motion using Einstein's theory 
would show that Boltzmann's constant is non-zero and would measure Avogadro's number. 

These experiments were carried out a few years later by Jean Baptiste Perrin, and gave a rough estimate of 
Avogadro's number consistent with the more accurate estimates due to Max Planck's theory of blackbody light and 
Robert Millikan's measurement of the charge of the electron J 61 ^ Unlike the other methods, Einstein's required very 
few theoretical assumptions or new physics, since it was directly measuring atomic motion on visible grains. 

Einstein's theory of Brownian motion was the first paper in the field of statistical physics. It established that 
thermodynamic fluctuations were related to dissipation. This was shown by Einstein to be true for time-independent 
fluctuations, but in the Brownian motion paper he showed that dynamical relaxation rates calculated from classical 
mechanics could be used as statistical relaxation rates to derive dynamical diffusion laws. These relations are known 
as Einstein relations. 

The theory of Brownian motion was the least revolutionary of Einstein's Annus mirabilis papers, but it is the most 
frequently cited, and had an important role in securing the acceptance of the atomic theory by physicists. 
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Thought experiments and a-priori physical principles 

Einstein's thinking underwent a transformation in 1905. He had come to understand that quantum properties of light 
mean that Maxwell's equations were only an approximation. He knew that new laws would have to replace these, but 
he did not know how to go about finding those laws. He felt that guessing formal relations would not go anywhere. 

So he decided to focus on a-priori principles instead, which are statements about physical laws which can be 
understood to hold in a very broad sense even in domains where they have not yet been shown to apply. A well 
accepted example of an a-priori principle is rotational invariance. If a new force is discovered in physics, it is 
assumed to be rotationally invariant almost automatically, without thought. Einstein sought new principles of this 
sort, to guide the production of physical ideas. Once enough principles are found, then the new physics will be the 
simplest theory consistent with the principles and with previously known laws. 

The first general a-priori principle he found was the principle of relativity, that uniform motion is indistinguishable 
from rest. This was understood by Hermann Minkowski to be a generalization of rotational invariance from space to 
space-time. Other principles postulated by Einstein and later vindicated are the principle of equivalence and the 
principle of adiabatic invariance of the quantum number. Another of Einstein's general principles, Mach's principle, 
is fiercely debated, and whether it holds in our world or not is still not definitively established. 

The use of a-priori principles is a distinctive unique signature of Einstein's early work, and has become a standard 
tool in modern theoretical physics. 

Special relativity 

His 1905 paper on the electrodynamics of moving bodies introduced his theory of special relativity, which showed 
that the observed independence of the speed of light on the observer's state of motion required fundamental changes 
to the notion of simultaneity. Consequences of this include the time- space frame of a moving body slowing down 
and contracting (in the direction of motion) relative to the frame of the observer. This paper also argued that the idea 
of a luminiferous aether - one of the leading theoretical entities in physics at the time - was superfluous J 62 ^ In his 
paper on mass-energy equivalence, which had previously been considered to be distinct concepts, Einstein deduced 
from his equations of special relativity what has been called the 20th century's best-known equation: E = mc 2 } 63 ^ t64] 
This equation suggests that tiny amounts of mass could be converted into huge amounts of energy and presaged the 
development of nuclear power. ^ Einstein's 1905 work on relativity remained controversial for many years, but was 
accepted by leading physicists, starting with Max Planck. ^ ^ 

Photons 

In a 1905 paper, ^ Einstein postulated that light itself consists of localized particles (quanta). Einstein's light quanta 
were nearly universally rejected by all physicists, including Max Planck and Niels Bohr. This idea only became 
universally accepted in 1919, with Robert Millikan's detailed experiments on the photoelectric effect, and with the 
measurement of Compton scattering. 

Einstein's paper on the light particles was almost entirely motivated by thermodynamic considerations. He was not at 
all motivated by the detailed experiments on the photoelectric effect, which did not confirm his theory until fifteen 
years later. Einstein considers the entropy of light at temperature T, and decomposes it into a low-frequency part and 
a high-frequency part. The high-frequency part, where the light is described by Wien's law, has an entropy which 
looks exactly the same as the entropy of a gas of classical particles. 

Since the entropy is the logarithm of the number of possible states, Einstein concludes that the number of states of 
short wavelength light waves in a box with volume V is equal to the number of states of a group of localizable 
particles in the same box. Since (unlike others) he was comfortable with the statistical interpretation, he confidently 
postulates that the light itself is made up of localized particles, as this is the only reasonable interpretation of the 
entropy. 
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This leads him to conclude that each wave of frequency / is associated with a collection of photons with energy hf 
each, where h is Planck's constant. He does not say much more, because he is not sure how the particles are related to 
the wave. But he does suggest that this idea would explain certain experimental results, notably the photoelectric 
effect. 1691 

Quantized atomic vibrations 

Einstein continued his work on quantum mechanics in 1906, by explaining the specific heat anomaly in solids. This 
was the first application of quantum theory to a mechanical system. Since Planck's distribution for light oscillators 
had no problem with infinite specific heats, the same idea could be applied to solids to fix the specific heat problem 
there. Einstein showed in a simple model that the hypothesis that solid motion is quantized explains why the specific 
heat of a solid goes to zero at zero temperature. 

Einstein's model treats each atom as connected to a single spring. Instead of connecting all the atoms to each other, 
which leads to standing waves with all sorts of different frequencies, Einstein imagined that each atom was attached 
to a fixed point in space by a spring. This is not physically correct, but it still predicts that the specific heat is 3Nk, 
since the number of independent oscillations stays the same. 

Einstein then assumes that the motion in this model is quantized, according to the Planck law, so that each 
independent spring motion has energy which is an integer multiple of hf, where f is the frequency of oscillation. 
With this assumption, he applied Boltzmann's statistical method to calculate the average energy of the spring. The 
result was the same as the one that Planck had derived for light: for temperatures where k^Tis much smaller than hf 
the motion is frozen, and the specific heat goes to zero. 

So Einstein concluded that quantum mechanics would solve the main problem of classical physics, the specific heat 
anomaly. The particles of sound implied by this formulation are now called phonons. Because all of Einstein's 
springs have the same stiffness, they all freeze out at the same temperature, and this leads to a prediction that the 
specific heat should go to zero exponentially fast when the temperature is low. The solution to this problem is to 
solve for the independent normal modes individually, and to quantize those. Then each normal mode has a different 
frequency, and long wavelength vibration modes freeze out at colder temperatures than short wavelength ones. This 
was done by Peter Debye, and after this modification Einstein's quantization method reproduced quantitatively the 
behavior of the specific heats of solids at low temperatures. 

This work was the foundation of condensed matter physics. 

Adiabatic principle and action-angle variables 

Throughout the 1910s, quantum mechanics expanded in scope to cover many different systems. After Ernest 
Rutherford discovered the nucleus and proposed that electrons orbit like planets, Niels Bohr was able to show that 
the same quantum mechanical postulates introduced by Planck and developed by Einstein would explain the discrete 
motion of electrons in atoms, and the periodic table of the elements. 

Einstein contributed to these developments by linking them with the 1898 arguments Wilhelm Wien had made. Wien 
had shown that the hypothesis of adiabatic invariance of a thermal equilibrium state allows all the blackbody curves 
at different temperature to be derived from one another by a simple shifting process. Einstein noted in 1911 that the 
same adiabatic principle shows that the quantity which is quantized in any mechanical motion must be an adiabatic 
invariant. Arnold Sommerfeld identified this adiabatic invariant as the action variable of classical mechanics. The 
law that the action variable is quantized was the basic principle of the quantum theory as it was known between 1900 
and 1925. 
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Wave-particle duality 

Although the patent office promoted Einstein to Technical Examiner Second Class in 1906, he had not given up on 
academia. In 1908, he became a privatdozent at the University of BernJ 70] In "iiber die Entwicklung unserer 
Anschauungen iiber das Wesen und die Konstitution der Strahlung" ("The Development of Our Views on the 
Composition and Essence of Radiation"), on the quantization of light, and in an earlier 1909 paper, Einstein showed 
that Max Planck's energy quanta must have well-defined momenta and act in some respects as independent, 
point-like particles. This paper introduced the photon concept (although the name photon was introduced later by 
Gilbert N. Lewis in 1926) and inspired the notion of wave-particle duality in quantum mechanics. 

Theory of critical opalescence 

Einstein returned to the problem of thermodynamic fluctuations, giving a treatment of the density variations in a 
fluid at its critical point. Ordinarily the density fluctuations are controlled by the second derivative of the free energy 
with respect to the density. At the critical point, this derivative is zero, leading to large fluctuations. The effect of 
density fluctuations is that light of all wavelengths is scattered, making the fluid look milky white. Einstein relates 
this to Raleigh scattering, which is what happens when the fluctuation size is much smaller than the wavelength, and 
which explains why the sky is blue. 



Zero-point energy 

Einstein's physical intuition led him to note that Planck's oscillator 
energies had an incorrect zero point. He modified Planck's hypothesis 
by stating that the lowest energy state of an oscillator is equal to V^/, to 
half the energy spacing between levels. This argument, which was made 
in 1913 in collaboration with Otto Stern, was based on the 
thermodynamics of a diatomic molecule which can split apart into two 
free atoms. 

Principle of equivalence 

In 1907, while still working at the patent office, Einstein had what he 
would call his "happiest thought". He realized that the principle of 
relativity could be extended to gravitational fields. He thought about the 
case of a uniformly accelerated box not in a gravitational field, and 
noted that it would be indistinguishable from a box sitting still in an 
unchanging gravitational field. He used special relativity to see that 
the rate of clocks at the top of a box accelerating upward would be 
faster than the rate of clocks at the bottom. He concludes that the rates 
of clocks depend on their position in a gravitational field, and that the 
difference in rate is proportional to the gravitational potential to first 
approximation. 

Although this approximation is crude, it allowed him to calculate the deflection of light by gravity, and show that it 
is nonzero. This gave him confidence that the scalar theory of gravity proposed by Gunnar Nordstrom was incorrect. 
But the actual value for the deflection that he calculated was too small by a factor of two, because the approximation 
he used doesn't work well for things moving at near the speed of light. When Einstein finished the full theory of 
general relativity, he would rectify this error and predict the correct amount of light deflection by the sun. 

From Prague, Einstein published a paper about the effects of gravity on light, specifically the gravitational redshift 
and the gravitational deflection of light. The paper challenged astronomers to detect the deflection during a solar 
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eclipse. German astronomer Erwin Finlay-Freundlich publicized Einstein's challenge to scientists around the 
world. [74] 

Einstein thought about the nature of the gravitational field in the years 1909-1912, studying its properties by means 
of simple thought experiments. A notable one is the rotating disk. Einstein imagined an observer making 
experiments on a rotating turntable. He noted that such an observer would find a different value for the mathematical 
constant pi than the one predicted by Euclidean geometry. The reason is that the radius of a circle would be 
measured with an uncontracted ruler, but, according to special relativity, the circumference would seem to be longer 
because the ruler would be contracted. 

Since Einstein believed that the laws of physics were local, described by local fields, he concluded from this that 
spacetime could be locally curved. This led him to study Riemannian geometry, and to formulate general relativity in 
this language. 

Hole argument and Entwurf theory 

While developing general relativity, Einstein became confused about the gauge in variance in the theory. He 
formulated an argument that led him to conclude that a general relativistic field theory is impossible. He gave up 
looking for fully generally co variant tensor equations, and searched for equations that would be invariant under 
general linear transformations only. 

In June, 1913 the Entwurf ("draft") theory was the result of these investigations. As its name suggests, it was a 
sketch of a theory, with the equations of motion supplemented by additional gauge fixing conditions. Simultaneously 
less elegant and more difficult than general relativity, after more than two years of intensive work Einstein 
abandoned the theory in November, 1915 after realizing that the hole argument was mistaken. 

General relativity 

In 1912, Einstein returned to Switzerland to accept a professorship at his alma mater, the ETH. Once back in Zurich, 
he immediately visited his old ETH classmate Marcel Grossmann, now a professor of mathematics, who introduced 
him to Riemannian geometry and, more generally, to differential geometry. On the recommendation of Italian 
mathematician Tullio Levi-Civita, Einstein began exploring the usefulness of general covariance (essentially the use 
of tensors) for his gravitational theory. For a while Einstein thought that there were problems with the approach, but 
he later returned to it and, by late 1915, had published his general theory of relativity in the form in which it is used 
today F 6 ^ This theory explains gravitation as distortion of the structure of spacetime by matter, affecting the inertial 
motion of other matter. During World War I, the work of Central Powers scientists was available only to Central 
Powers academics, for national security reasons. Some of Einstein's work did reach the United Kingdom and the 
United States through the efforts of the Austrian Paul Ehrenfest and physicists in the Netherlands, especially 1902 
Nobel Prize-winner Hendrik Lorentz and Willem de Sitter of Leiden University. After the war ended, Einstein 
maintained his relationship with Leiden University, accepting a contract as an Extraordinary Professor, for ten 
years, from 1920 to 1930, he travelled to Holland regularly to lecture. 

In 1917, several astronomers accepted Einstein 's 1911 challenge from Prague. The Mount Wilson Observatory in 
California, U.S., published a solar spectroscopic analysis that showed no gravitational redshift.' 78 ^ In 1918, the Lick 
Observatory, also in California, announced that it too had disproved Einstein's prediction, although its findings were 
not published. [79] 
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Eddington's photograph of a solar eclipse, which 
confirmed Einstein's theory that light "bends". 



movement. 



[83] [84] 



However, in May 1919, a team led by the British astronomer Arthur 
Stanley Eddington claimed to have confirmed Einstein's prediction of 
gravitational deflection of starlight by the Sun while photographing a 
solar eclipse with dual expeditions in Sobral, northern Brazil, and 
Principe, a west African island Nobel laureate Max Born praised 
general relativity as the "greatest feat of human thinking about 
nature" fellow laureate Paul Dirac was quoted saying it was 

roil 

"probably the greatest scientific discovery ever made". L ° 1J The 
international media guaranteed Einstein's global renown. 

There have been claims that scrutiny of the specific photographs taken 

on the Eddington expedition showed the experimental uncertainty to be 

comparable to the same magnitude as the effect Eddington claimed to 

have demonstrated, and that a 1962 British expedition concluded that 

[371 

the method was inherently unreliable. The deflection of light during 
a solar eclipse was confirmed by later, more accurate observations J 82 ^ 
Some resented the newcomer's fame, notably among some German 
physicists, who later started the Deutsche Physik (German Physics) 



Cosmology 

In 1917, Einstein applied the General theory of relativity to model the structure of the universe as a whole. He 
wanted the universe to be eternal and unchanging, but this type of universe is not consistent with relativity. To fix 
this, Einstein modified the general theory by introducing a new notion, the cosmological constant. With a positive 
cosmological constant, the universe could be an eternal static sphere^ 85 ^ 

Einstein believed a spherical static universe is philosophically preferred, because it would obey Mach's principle. He 
had shown that general relativity incorporates Mach's principle to a certain extent in frame dragging by 
gravitomagnetic fields, but he knew that Mach's idea would not work if space goes on forever. In a closed universe, 
he believed that Mach's principle would hold. 

Mach's principle has generated much controversy over the years. 
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Modern quantum theory 

In 1917, at the height of his work on relativity, Einstein published an 
article in Physikalische Zeitschrift that proposed the possibility of 
stimulated emission, the physical process that makes possible the 
maser and the laser J 86 ^ This article showed that the statistics of 
absorption and emission of light would only be consistent with 
Planck's distribution law if the emission of light into a mode with n 
photons would be enhanced statistically compared to the emission of 
light into an empty mode. This paper was enormously influential in the 
later development of quantum mechanics, because it was the first paper 
to show that the statistics of atomic transitions had simple laws. 
Einstein discovered Louis de Broglie's work, and supported his ideas, 
which were received skeptically at first. In another major paper from 
this era, Einstein gave a wave equation for de Broglie waves, which 
Einstein suggested was the Hamilton-Jacobi equation of mechanics. 
This paper would inspire Schrodinger's work of 1926. 




Einstein in his office at the University of Berlin. 



Bose-Einstein statistics 

In 1924, Einstein received a description of a statistical model from Indian physicist Satyendra Nath Bose, based on a 
counting method that assumed that light could be understood as a gas of indistinguishable particles. Einstein noted 
that Bose's statistics applied to some atoms as well as to the proposed light particles, and submitted his translation of 
Bose's paper to the Zeitschrift filr Physik. Einstein also published his own articles describing the model and its 
implications, among them the Bose-Einstein condensate phenomenon that some particulates should appear at very 

TR71 

low temperatures. It was not until 1995 that the first such condensate was produced experimentally by Eric Allin 
Cornell and Carl Wieman using ultra-cooling equipment built at the NIST-JILA laboratory at the University of 
Colorado at Boulder J 88 ^ Bose-Einstein statistics are now used to describe the behaviors of any assembly of bosons. 
Einstein's sketches for this project may be seen in the Einstein Archive in the library of the Leiden University.^ 

Energy momentum pseudotensor 

General relativity includes a dynamical spacetime, so it is difficult to see how to identify the conserved energy and 
momentum. Noether's theorem allows these quantities to be determined from a Lagrangian with translation 
invariance, but general covariance makes translation invariance into something of a gauge symmetry. The energy 
and momentum derived within general relativity by Noether's presecriptions do not make a real tensor for this 
reason. 

Einstein argued that this is true for fundamental reasons, because the gravitational field could be made to vanish by a 
choice of coordinates. He maintained that the non-covariant energy momentum pseudotensor was in fact the best 
description of the energy momentum distribution in a gravitational field. This approach has been echoed by Lev 
Landau and Evgeny Lifshitz, and others, and has become standard. 

The use of non-covariant objects like pseudotensors was heavily criticized in 1917 by Erwin Schrodinger and others. 
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Unified field theory 

Following his research on general relativity, Einstein entered into a series of attempts to generalize his geometric 
theory of gravitation, which would allow the explanation of electromagnetism. In 1950, he described his "unified 
field theory" in a Scientific American article entitled "On the Generalized Theory of Gravitation".^ Although he 
continued to be lauded for his work, Einstein became increasingly isolated in his research, and his efforts were 
ultimately unsuccessful. In his pursuit of a unification of the fundamental forces, Einstein ignored some mainstream 
developments in physics, most notably the strong and weak nuclear forces, which were not well understood until 
many years after his death. Mainstream physics, in turn, largely ignored Einstein's approaches to unification. 
Einstein's dream of unifying other laws of physics with gravity motivates modern quests for a theory of everything 
and in particular string theory, where geometrical fields emerge in a unified quantum-mechanical setting. 

Wormholes 

Einstein collaborated with others to produce a model of a wormhole. His motivation was to model elementary 
particles with charge as a solution of gravitational field equations, in line with the program outlined in the paper "Do 
Gravitational Fields play an Important Role in the Constitution of the Elementary Particles?". These solutions cut 
and pasted Schwarzschild black holes to make a bridge between two patches. 

If one end of a wormhole was positively charged, the other end would be negatively charged. These properties led 
Einstein to believe that pairs of particles and antiparticles could be described in this way. 

Einstein-Cartan theory 

In order to incorporate spinning point particles into general relativity, the affine connection needed to be generalized 
to include an antisymmetric part, called the torsion. This modification was made by Einstein and Cartan in the 1920s. 

Equations of motion 

The theory of general relativity has a fundamental law - the Einstein equations which describe how space curves, 
the geodesic equation which describes how particles move may be derived from the Einstein equations. 

Since the equations of general relativity are non-linear, a lump of energy made out of pure gravitational fields, like a 
black hole, would move on a trajectory which is determined by the Einstein equations themselves, not by a new law. 
So Einstein proposed that the path of a singular solution, like a black hole, would be determined to be a geodesic 
from general relativity itself. 

This was established by Einstein, Infeld and Hoffmann for pointlike objects without angular momentum, and by Roy 
Kerr for spinning objects. 

Einstein's controversial beliefs in physics 

In addition to his well-accepted results, some of Einstein's views are regarded as controversial: 

• In the special relativity paper (in 1905), Einstein noted that, given a specific definition of the word "force" (a 
definition which he later agreed was not advantageous), and if we choose to maintain (by convention) the 
equation mass x acceleration = force, then one arrives at m/(l-i; 2 /c 2 )as the expression for the transverse mass of 
a fast moving particle. This differs from the accepted expression today, because, as noted in the footnotes to 
Einstein's paper added in the 1913 reprint, "it is more to the point to define force in such a way that the laws of 
energy and momentum assume the simplest form", as was done, for example, by Max Planck in 1906, who gave 
the now familiar expression m/^/l-v 2 /c 2 for the transverse mass. As Miller points out, this is equivalent to the 
transverse mass predictions of both Einstein and Lorentz. Einstein had commented already in the 1905 paper that 
"With a different definition of force and acceleration, we should naturally obtain other expressions for the masses. 
This shows that in comparing different theories... we must proceed very cautiously." t90] 
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• Einstein published (in 1922) a qualitative theory of superconductivity based on the vague idea of electrons shared 
in orbits. This paper predated modern quantum mechanics, and today is regarded as being incorrect. The current 
theory of low temperature superconductivity was only worked out in 1957, thirty years after the establishing of 
modern quantum mechanics. However, even today, superconductivity is not well understood, and alternative 
theories continue to be put forward, especially to account for high- temperature superconductors. 

• After introducing the concept of gravitational waves in 1917, Einstein subsequently entertained doubts about 
whether they could be physically realized. In 1937 he published a paper saying that the focusing properties of 
geodesies in general relativity would lead to an instability which causes plane gravitational waves to collapse in 
on themselves. While this is true to a certain extent in some limits, because gravitational instabilities can lead to a 
concentration of energy density into black holes, for plane waves of the type Einstein and Rosen considered in 
their paper, the instabilities are under control. Einstein retracted this position a short time later. 

• Einstein denied several times that black holes could form. In 1939 he published a paper that argues that a star 
collapsing would spin faster and faster, spinning at the speed of light with infinite energy well before the point 
where it is about to collapse into a black hole. This paper received no citations, and the conclusions are well 
understood to be wrong. Einstein's argument itself is inconclusive, since he only shows that stable spinning 
objects have to spin faster and faster to stay stable before the point where they collapse. But it is well understood 
today (and was understood well by some even then) that collapse cannot happen through stationary states the way 
Einstein imagined. Nevertheless, the extent to which the models of black holes in classical general relativity 
correspond to physical reality remains unclear, and in particular the implications of the central singularity implicit 
in these models are still not understood. Efforts to conclusively prove the existence of event horizons have still 
not been successful. 

• Closely related to his rejection of black holes, Einstein believed that the exclusion of singularities might restrict 
the class of solutions of the field equations so as to force solutions compatible with quantum mechanics, but no 
such theory has ever been found. 

• In the early days of quantum mechanics, Einstein tried to show that the uncertainty principle was not valid, but by 
1927 he had become convinced that it was valid. 

• In the EPR paper, Einstein argued that quantum mechanics cannot be a complete realistic and local representation 
of phenomena, given specific definitions of "realism", "locality", and "completeness". The modern consensus is 
that Einstein's concept of realism is too restrictive. 

• Einstein himself considered the introduction of the cosmological term in his 1917 paper founding cosmology as a 
"blunder". The theory of general relativity predicted an expanding or contracting universe, but Einstein wanted 
a universe which is an unchanging three dimensional sphere, like the surface of a three dimensional ball in four 
dimensions. He wanted this for philosophical reasons, so as to incorporate Mach's principle in a reasonable way. 
He stabilized his solution by introducing a cosmological constant, and when the universe was shown to be 
expanding, he retracted the constant as a blunder. This is not really much of a blunder - the cosmological constant 
is necessary within general relativity as it is currently understood, and it is widely believed to have a nonzero 
value today. 

• Einstein did not immediately appreciate the value of Minkowski's four-dimensional formulation of special 
relativity, although within a few years he had adopted it as the basis for his theory of gravitation. 

• Finding it too formal, Einstein believed that Heisenberg's matrix mechanics was incorrect. He changed his mind 
when Schrodinger and others demonstrated that the formulation in terms of the Schrodinger equation, based on 
Einstein's wave-particle duality was equivalent to Heisenberg's matrices. 
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Collaboration with other scientists 

In addition to long time collaborators Leopold Infeld, Nathan Rosen, Peter Bergmann and others, Einstein also had 
some one-shot collaborations with various scientists. 

Einstein-de Haas experiment 

Einstein and De Haas demonstrated that magnetization is due to the motion of electrons, nowadays known to be the 
spin. In order to show this, they reversed the magnetization in an iron bar suspended on a torsion pendulum. They 
confirmed that this leads the bar to rotate, because the electron's angular momentum changes as the magnetization 
changes. This experiment needed to be sensitive, because the angular momentum associated with electrons is small, 
but it definitively established that electron motion of some kind is responsible for magnetization. 

Schrodinger gas model 

Einstein suggested to Erwin Schrodinger that he might be able to reproduce the statistics of a Bose-Einstein gas by 
considering a box. Then to each possible quantum motion of a particle in a box associate an independent harmonic 
oscillator. Quantizing these oscillators, each level will have an integer occupation number, which will be the number 
of particles in it. 

This formulation is a form of second quantization, but it predates modern quantum mechanics. Erwin Schrodinger 
applied this to derive the thermodynamic properties of a semiclassical ideal gas. Schrodinger urged Einstein to add 
his name as co-author, although Einstein declined the invitation. 1 

Einstein refrigerator 

In 1926, Einstein and his former student Leo Szilard co-invented (and in 1930, patented) the Einstein refrigerator. 
This absorption refrigerator was then revolutionary for having no moving parts and using only heat as an input. 
On 11 November 1930, U.S. Patent 1781541 t94] was awarded to Albert Einstein and Leo Szilard for the refrigerator. 
Their invention was not immediately put into commercial production, as the most promising of their patents were 
quickly bought up by the Swedish company Electrolux to protect its refrigeration technology from competition. 1 

Bohr versus Einstein 

In the 1920s, quantum mechanics developed into a more complete theory. 
Einstein was unhappy with the Copenhagen interpretation of quantum theory 
developed by Niels Bohr and Werner Heisenberg, both in its outcomes and its 
instrumentalist methodology, Einstein being a scientific realist. In this 
interpretation, quantum phenomena are inherently probabilistic, with definite 
states resulting only upon interaction with classical systems. A public debate 
between Einstein and Bohr followed, lasting on and off for many years 
(including during the Solvay Conferences). Einstein formulated thought 
experiments against the Copenhagen interpretation, which were all rebutted by 
Bohr. In a 1926 letter to Max Born, Einstein wrote: "I, at any rate, am convinced 
that He [God] does not throw dice." [96] 

Einstein was never satisfied by what he perceived to be quantum theory's 
intrinsically incomplete description of nature, and in 1935 he further explored the 
issue in collaboration with Boris Podolsky and Nathan Rosen, noting that the 

theory seems to require non-local interactions; this is known as the EPR paradox. The EPR experiment has since 
been performed, with results confirming quantum theory's predictions. t98] Repercussions of the Einstein-Bohr 
debate have found their way into philosophical discourse. 




Einstein and Niels Bohr, 1925 
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Einstein-Podolsky-Rosen paradox 

In 1935, Einstein returned to the question of quantum mechanics. He considered how a measurement on one of two 
entangled particles would affect the other. He noted, along with his collaborators, that by performing different 
measurements on the distant particle, either of position or momentum, different properties of the entangled partner 
could be discovered without disturbing it in any way. 

He then used a hypothesis of local realism to conclude that the other particle had these properties already 
determined. The principle he proposed is that if it is possible to determine what the answer to a position or 
momentum measurement would be, without in any way disturbing the particle, then the particle actually has values 
of position or momentum. 

This principle distilled the essence of Einstein's objection to quantum mechanics. As a physical principle, it has since 
been shown to be incompatible with experiments. 



Political views 



Einstein flouted the ascendant Nazi movement and later tried to be a 
voice of moderation in the tumultuous formation of the State of 

mm 

Israel. Fred Jerome in his Einstein on Israel and Zionism: His 
Provocative Ideas About the Middle East argues that Einstein was a 
Cultural Zionist who supported the idea of a Jewish homeland but 
opposed the establishment of a Jewish state in Palestine "with borders, 
an army, and a measure of temporal power." Instead, he preferred a 
bi-national state with "continuously functioning, mixed, administrative, 
economic, and social organizations."^ 100 ^ ^ 101 ^ However Ami Isseroff in 
his article Was Einstein a Zionist, argues that Einstein supported the 
recognition of the State of Israel and declared it "the fulfillment of our 
dream" when President Harry Truman recognize Israel in May 1948 
and in presidential election 1948 Einstein supported Henry A. 
Wallace's Progressive Party which advocate pro-Soviet and pro-Israel 
foreign policy 




[102] [103] 



Albert Einstein, seen here with his wife Elsa 
Einstein and Zionist leaders, including future 
President of Israel Chaim Weizmann, his wife Dr. 

Vera Weizmann, Menahem Ussishkin, and 
Ben-Zion Mossinson on arrival in New York City 
in 1921. 



Throughout the November Revolution in Germany Einstein signed an appeal for the foundation of a nationwide 
liberal and democratic partyj 104 ^ ^ 105 ^ which was published in the Berliner Tageblatt on 16 November 1918^ 106 ^ and 
became a member of the German Democratic Party. 11 071 

In his article Why Socialism?,^ 108 ^ published in 1949 in the Monthly Review, Einstein described a chaotic capitalist 
society, a source of evil to be overcome, as the "predatory phase of human development". He came to the following 
conclusion: 

I am convinced there is only one way to eliminate these grave evils [capitalism], namely through the 
establishment of a socialist economy, accompanied by an educational system which would be oriented 
toward social goals. In such an economy, the means of production are owned by society itself and are 
utilized in a planned fashion. A planned economy, which adjusts production to the needs of the 
community, would distribute the work to be done among all those able to work and would guarantee a 
livelihood to every man, woman, and child. The education of the individual, in addition to promoting his 
own innate abilities, would attempt to develop in him a sense of responsibility for his fellow men in 
place of the glorification of power and success in our present society. tl08] 

He braved anti-communist politics and resistance to the civil rights movement in the United States. On the floor of 
the US Congress, Einstein was accused by John E. Rankin of Mississippi of being a "foreign-born agitator" who 
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sought "to further the spread of Communism throughout the world". He also participated in the 1927 congress of 
the League against Imperialism in Brussels J 1 10] 

After World War II, as enmity between the former allies became a serious issue, Einstein wrote, "I do not know how 
the third World War will be fought, but I can tell you what they will use in the Fourth - rocks!" [111] (Einstein 1949) 
With Albert Schweitzer and Bertrand Russell, Einstein lobbied to stop nuclear testing and future bombs. Days before 
his death, Einstein signed the Russell-Einstein Manifesto, which led to the Pugwash Conferences on Science and 
World Affairs. [112] 

Einstein was a member of several civil rights groups, including the Princeton chapter of the NAACP. When the aged 
W. E. B. Du Bois was accused of being a Communist spy, Einstein volunteered as a character witness, and the case 
was dismissed shortly afterward. Einstein's friendship with activist Paul Robeson, with whom he served as co-chair 
of the American Crusade to End Lynching, lasted twenty years P 

Einstein said "Politics is for the moment, equation for the eternity. M ^ 114 ^ He declined the presidency of Israel in 
1952. [115] 



Religious views 

The question of scientific determinism gave rise to questions about Einstein's position on theological determinism, 
and whether or not he believed in God, or in a god. He once said: 

You may call me an agnostic... I do not share the crusading spirit of the professional atheist whose fervor is 
mostly due to a painful act of liberation from the fetters of religious indoctrination received in youth. I prefer 
an attitude of humility corresponding to the weakness of our intellectual understanding of nature and of our 
own being. 



Non-scientific legacy 

While travelling, Einstein wrote daily to his wife Elsa and adopted stepdaughters Margot and Use. The letters were 
included in the papers bequeathed to The Hebrew University. Margot Einstein permitted the personal letters to be 
made available to the public, but requested that it not be done until twenty years after her death (she died in 1986^ 
). Barbara Wolff, of The Hebrew University's Albert Einstein Archives, told the BBC that there are about 3,500 
pages of private correspondence written between 1912 and 1955 P 

Einstein bequeathed the royalties from use of his image to The Hebrew University of Jerusalem. Corbis, successor to 

The Roger Richman Agency, licenses the use of his name and associated imagery, as agent for the university/ 1 19] 
[120] 



In popular culture 

In the period before World War II, Einstein was so well-known in America that he would be stopped on the street by 
people wanting him to explain "that theory". He finally figured out a way to handle the incessant inquiries. He told 
his inquirers "Pardon me, sorry! Always I am mistaken for Professor Einstein."^ 121 ^ 

[1221 

Einstein has been the subject of or inspiration for many novels, films, plays, and works of music. He is a favorite 
model for depictions of mad scientists and absent-minded professors; his expressive face and distinctive hairstyle 
have been widely copied and exaggerated. Time magazine's Frederic Golden wrote that Einstein was "a cartoonist's 
dream come true" J 123] 
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Awards and honors 

In 1922, Einstein was awarded the 1921 Nobel Prize in PhysicsJ 124 ^ 
"for his services to Theoretical Physics, and especially for his 
discovery of the law of the photoelectric effect". This refers to his 1905 
paper on the photoelectric effect, "On a Heuristic Viewpoint 
Concerning the Production and Transformation of Light", which was 
well supported by the experimental evidence by that time. The 
presentation speech began by mentioning "his theory of relativity 
[which had] been the subject of lively debate in philosophical circles 
[and] also has astrophysical implications which are being rigorously 
examined at the present time". (Einstein 1923) 

It was long reported that Einstein gave the Nobel prize money to his 
first wife, Mileva Marie, in compliance with their 1919 divorce 
settlement. However, personal correspondence made public in 
2006^ 125 ^ shows that he invested much of it in the United States, and 
saw much of it wiped out in the Great Depression. 




In 1999 Albert Einstein was named the Person of 
the Century. 



In 1929, Max Planck presented Einstein with the Max Planck medal of 
the German Physical Society in Berlin, for extraordinary achievements 
in theoretical physics J 126 ^ 

In 1936, Einstein was awarded the Franklin Institute's Franklin Medal for his extensive work on relativity and the 
photo-electric effect. 



[126] 



The International Union of Pure and Applied Physics named 2005 the "World Year of Physics" in commemoration 

n 27] 

of the 100th anniversary of the publication of the annus mirabilis papers. 

The Albert Einstein Science Park is located on the hill Telegrafenberg in Potsdam, Germany. The best known 
building in the park is the Einstein Tower which has a bronze bust of Einstein at the entrance. The Tower is an 
astrophysical observatory that was built to perform checks of Einstein's theory of General Relativity. 

The Albert Einstein Memorial in central Washington, D.C. is a monumental bronze statue depicting Einstein seated 
with manuscript papers in hand. The statue, commissioned in 1979, is located in a grove of trees at the southwest 
corner of the grounds of the National Academy of Sciences on Constitution Avenue. 

The chemical element 99, einsteinium, was named for him in August 1955, four months after Einstein's deathJ 129 ^ 
[130] 1 Ei ns tein is an inner main belt asteroid discovered on 5 March 1973 P 3 ^ 

[1231 r 1 321 

In 1999 Time magazine named him the Person of the Century, 1 ahead of Mahatma Gandhi and Franklin 

Roosevelt, among others. In the words of a biographer, "to the scientifically literate and the public at large, Einstein 

[1331 

is synonymous with genius". J Also in 1999, an opinion poll of 100 leading physicists ranked Einstein the 
"greatest physicist ever" J 1 34 ^ A Gallup poll recorded him as the fourth most admired person of the 20th century in 
the U.S. [135] 

In 1990, his name was added to the Walhalla temple for "laudable and distinguished Germans "J 136 ^ which is located 

[1371 

east of Regensburg, in Bavaria, Germany. 1 

The United States Postal Service honored Einstein with a Prominent Americans series (1965-1978) 80 postage 
stamp. 
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Awards named after him 

The Albert Einstein Award (sometimes called the Albert Einstein Medal because it is accompanied with a gold 
medal) is an award in theoretical physics, established to recognize high achievement in the natural sciences. It was 
endowed by the Lewis and Rosa Strauss Memorial Fund in honor of Albert Einstein's 70th birthday. It was first 
awarded in 1951 and included a prize money of $ 15,000, [138] [139] which was later reduced to $ 5,000. [140] [141] The 
winner is selected by a committee (the first of which consisted of Einstein, Oppenheimer, von Neumann and 
Weyl^ 142 ^ ) of the Institute for Advanced Study, which administers the award J 139 ^ 

The Albert Einstein Medal is an award presented by the Albert Einstein Society in Bern, Switzerland. First given in 
1979, the award is presented to people who have "rendered outstanding services" in connection with Einstein J 143 ^ 

The Albert Einstein Peace Prize is given yearly by the Chicago, Illinois-based Albert Einstein Peace Prize 
Foundation. Winners of the prize receive $50,000 J 144] 



Publications 

The following publications by Albert Einstein are referenced in this article. A more complete list of his 
publications may be found at List of scientific publications by Albert Einstein. 

• Einstein, Albert (1901), "Folgerungen aus den Capillaritatserscheinungen (Conclusions Drawn from the 
Phenomena of Capillarity)", Annalen der Physik 4: 513, doi: 10. 1002/andp. 1901 3090306 

• Einstein, Albert (1905a), "On a Heuristic Viewpoint Concerning the Production and Transformation of Light" 
^ 145 ^, Annalen der Physik 17: 132-148 . This annus mirabilis paper on the photoelectric effect was received by 
Annalen der Physik 18 th March. 

• Einstein, Albert (1905b), A new determination of molecular dimensions. This PhD thesis was completed 30 th 
April and submitted 20 th July. 

• Einstein, Albert (1905c), "On the Motion - Required by the Molecular Kinetic Theory of Heat - of Small 
Particles Suspended in a Stationary Liquid", Annalen der Physik 17: 549-560. This annus mirabilis paper on 
Brownian motion was received 1 1 th May. 

• Einstein, Albert (1905d), "On the Electrodynamics of Moving Bodies", Annalen der Physik 17: 891-921. This 
annus mirabilis paper on special relativity was received 30 th June. 

• Einstein, Albert (1905e), "Does the Inertia of a Body Depend Upon Its Energy Content?", Annalen der Physik 18: 
639-641. This annus mirabilis paper on mass-energy equivalence was received 27 th September. 

• Einstein, Albert (1915), "Die Feldgleichungen der Gravitation (The Field Equations of Gravitation)", Koniglich 
Preussische Akademie der Wissenschaften: 844-847 

• Einstein, Albert (1917a), "Kosmologische Betrachtungen zur allgemeinen Relativitatstheorie (Cosmological 
Considerations in the General Theory of Relativity)", Koniglich Preussische Akademie der Wissenschaften 

• Einstein, Albert (1917b), "Zur Quantentheorie der Strahlung (On the Quantum Mechanics of Radiation)", 
Physikalische Zeitschrift 18: 121-128 

• Einstein, Albert (11 July 1923), "Fundamental Ideas and Problems of the Theory of Relativity" ^ U6 \ Nobel 
Lectures, Physics 1901—1921, Amsterdam: Elsevier Publishing Company, retrieved 25 March 2007 

• Einstein, Albert (1924), "Quantentheorie des einatomigen idealen Gases (Quantum theory of monatomic ideal 
gases)", Sitzungsberichte der P reus sichen Akademie der Wissenschaften Physikalisch-Mathematische Klasse: 
261—267. First of a series of papers on this topic. 

• Einstein, Albert (1926), "Die Ursache der Maanderbildung der Flusslaufe und des sogenannten Baerschen 
Gesetzes", Die Naturwissenschaften 14: 223-224, doi: 10. 1007/BF015 10300. On Baer's law and meanders in the 
courses of rivers. 

• Einstein, Albert; Podolsky, Boris; Rosen, Nathan (15 May 1935), "Can Quantum-Mechanical Description of 
Physical Reality Be Considered Complete?", Physical Review 47 (10): 777-780, doi:10.1103/PhysRev.47.777 
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• Einstein, Albert (1940), "On Science and Religion", Nature (Edinburgh: Scottish Academic) 146: 605, 
doi:10.1038/146605a0, ISBN 0707304539 

• Einstein, Albert et al (4 December 1948), "To the editors" [147] , New York Times (Melville, NY: AIP, American 
Inst, of Physics), ISBN 0735403597 

• Einstein, Albert (May 1949), "Why Socialism?" [148] , Monthly Review, retrieved 16 January 2006 

• Einstein, Albert (1950), "On the Generalized Theory of Gravitation", Scientific American CLXXXII (4): 13-17 

• Einstein, Albert (1954), Ideas and Opinions, New York: Random House, ISBN 0-517-00393-7 

• Einstein, Albert (1969) (in German), Albert Einstein, Hedwig undMax Born: Briefwechsel 1 916-1955, Munich: 
Nymphenburger Verlagshandlung, ISBN 388682005X 

• Einstein, Albert (1979), Autobiographical Notes, Paul Arthur Schilpp (Centennial ed.), Chicago: Open Court, 
ISBN 0-875-48352-6. The chasing a light beam thought experiment is described on pages 48-51. 

• Collected Papers: Stachel, John, Martin J. Klein, a. J. Kox, Michel Janssen, R. Schulmann, Diana Komos 
Buchwald and others (Eds.) (1987-2006), The Collected Papers of Albert Einstein, Vol. I— 10 ^ l49 \ Princeton 
University Press Further information about the volumes published so far can be found on the webpages of the 
Einstein Papers Project ^ 150 ^ and on the Princeton University Press Einstein Page ^ 151 ^ 
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Erwin Rudolf Josef Alexander Schrodinger (German 
pronunciation: ['8Kvi:n 'J^'clirjB]; 12 August 1887- 4 January 1961) 
was an Austrian theoretical physicist who was one of the fathers of 
quantum mechanics, and is famed for a number of important 
contributions to physics, especially the Schrodinger equation, for 
which he received the Nobel Prize in Physics in 1933. In 1935, after 
extensive correspondence with personal friend Albert Einstein, he 
proposed the Schrodinger's cat thought experiment. 

Biography 

Early years 

In 1887 Schrodinger was born in Vienna, Austria to Rudolf 
Schrodinger (cerecloth producer, botanist) and Georgine Emilia 
Brenda (daughter of Alexander Bauer, Professor of Chemistry, k.u.k. 
Technische Hochschule Vienna). 




Bust of Schrodinger, in the courtyard arcade of 
the main building, University of Vienna, Austria. 



His mother was half Austrian and half English; the English side of her family came from Leamington Spa. 
Schrodinger learned English and German almost at the same time due to the fact that both were spoken in the family 
household. His father was a Catholic and his mother was a Lutheran. 

In 1898 he attended the Akademisches Gymnasium. Between 1906 and 1910 Schrodinger studied in Vienna under 
Franz Serafin Exner (1849—1926) and Friedrich Hasenohrl (1874—1915). He also conducted experimental work with 
Karl Wilhelm Friedrich ("Fritz") Kohlrausch (1884— 1953). ^ ^ In 1911 Schrodinger became an assistant to Exner. 
At an early age, Schrodinger was strongly influenced by Schopenhauer. As a result of his extensive reading of 
Schopenhauer's works, he became deeply interested throughout his life in color theory, philosophy, ^ perception, 
and eastern religion, especially Hindu Vedanta. 



Middle years 

In 1914 Erwin Schrodinger achieved Habilitation (venia legendi). Between 1914 and 1918 he participated in war 
work as a commissioned officer in the Austrian fortress artillery (Gorizia, Duino, Sistiana, Prosecco, Vienna). On 6 
April 1920, Schrodinger married Annemarie Bertel. The same year, he became the assistant to Max Wien, in Jena, 
and in September 1920 he attained the position of ao. Prof. (Ausserordentlicher Professor), roughly equivalent to 
Reader (UK) or associate professor (US), in Stuttgart. In 1921, he became o. Prof. (Ordentlicher Professor, i.e. full 
professor), in Breslau (now Wroclaw, Poland). 

In 1921, he moved to the University of Zurich. In January 1926, Schrodinger published in Annalen der Physik the 
paper " Quantisierung als Eigenwertproblern" [tr. Quantization as an Eigenvalue Problem] on wave mechanics and 
what is now known as the Schrodinger equation. In this paper he gave a "derivation" of the wave equation for time 
independent systems, and showed that it gave the correct energy eigenvalues for the hydrogen-like atom. This paper 
has been universally celebrated as one of the most important achievements of the twentieth century, and created a 
revolution in quantum mechanics, and indeed of all physics and chemistry. A second paper was submitted just four 
weeks later that solved the quantum harmonic oscillator, the rigid rotor and the diatomic molecule, and gives a new 
derivation of the Schrodinger equation. A third paper in May showed the equivalence of his approach to that of 
Heisenberg and gave the treatment of the Stark effect. A fourth paper in this most remarkable series showed how to 
treat problems in which the system changes with time, as in scattering problems. These papers were the central 
achievement of his career and were at once recognized as having great significance by the physics community. 
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In 1927, he succeeded Max Planck at the Friedrich Wilhelm University in Berlin. In 1933, however, Schrodinger 
decided to leave Germany; he disliked the Nazis' anti-semitism. He became a Fellow of Magdalen College at the 
University of Oxford. Soon after he arrived, he received the Nobel Prize together with Paul Adrien Maurice Dirac. 
His position at Oxford did not work out; his unconventional personal life (Schrodinger lived with two women/ 5 ^ was 
not met with acceptance. In 1934, Schrodinger lectured at Princeton University; he was offered a permanent position 
there, but did not accept it. Again, his wish to set up house with his wife and his mistress may have posed a problem. 
He had the prospect of a position at the University of Edinburgh but visa delays occurred, and in the end he took up a 
position at the University of Graz in Austria in 1936. 

In the midst of these tenure issues in 1935, after extensive correspondence with personal friend Albert Einstein, he 
proposed the Schrodinger's cat thought experiment. 





Erwin Schrodinger in 1933 



Later years 

In 1939, after the Anschluss, Schrodinger had problems because of his 
flight from Germany in 1933 and his known opposition to Nazism. He 
issued a statement recanting this opposition (he later regretted doing 
so, and he personally apologized to Einstein). However, this did not 
fully appease the new dispensation and the university dismissed him 
from his job for political unreliability. He suffered harassment and 
received instructions not to leave the country, but he and his wife fled 
to Italy. From there he went to visiting positions in Oxford and Ghent 
Universities. 

In 1940 he received a personal invitation from Ireland's Taoiseach 
Eamon de Valera to reside in Ireland and agree to help establish an 
Institute for Advanced Studies in Dublin. He moved to Clontarf, 
Dublin and became the Director of the School for Theoretical Physics 
and remained there for 17 years, during which time he became a 
naturalized Irish citizen. He wrote about 50 further publications on 
various topics, including his explorations of unified field theory. 

In 1944, he wrote What is Life?, which contains a discussion of negentropy and the concept of a complex molecule 
with the genetic code for living organisms. According to James D. Watson's memoir, DNA, the Secret of Life, 
Schrodinger's book gave Watson the inspiration to research the gene, which led to the discovery of the DNA double 
helix structure. Similarly, Francis Crick, in his autobiographical book What Mad Pursuit, described how he was 
influenced by Schrodinger's speculations about how genetic information might be stored in molecules. However, the 
geneticist and 1946 Nobel-prize winner H.J. Muller had in his 1922 article "Variation due to Change in the 
Individual Gene"^ already laid out all the basic properties of the heredity molecule that Schrodinger derives from 
first principles in What is Life?, properties which Muller refined in his 1929 article "The Gene As The Basis of 

T71 T81 

Life" and further clarified during the 1930s, long before the publication of What is Life?. 

Schrodinger stayed in Dublin until retiring in 1955. During this time he remained committed to his particular 
passion; involvements with students occurred and he fathered two children by two different Irish women . He had a 
life-long interest in the Vedanta philosophy of Hinduism, which influenced his speculations at the close of What is 
Life? about the possibility that individual consciousness is only a manifestation of a unitary consciousness pervading 

rm 

the universe. 

In 1956, he returned to Vienna (chair ad personam). At an important lecture during the World Energy Conference he 
refused to speak on nuclear energy because of his skepticism about it and gave a philosophical lecture instead. 
During this period Schrodinger turned from mainstream quantum mechanics' definition of wave-particle duality and 
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promoted the wave idea alone causing much controversy 
Personal life 

Schrodinger suffered from tuberculosis and several times in the 1920s stayed at a sanatorium in Arosa. It was there 
that he discovered his wave equation J 10 ^ 

Schrodinger decided in 1933 that he could not live in a country in which persecution of Jews had become a national 
policy. Alexander Frederick Lindemann, the head of physics at Oxford University, visited Germany in the spring of 
1933 to try to arrange positions in England for some young Jewish scientists from Germany. He spoke to 
Schrodinger about posts for one of his assistants and was surprised to discover that Schrodinger himself was 
interested in leaving Germany. Schrodinger asked for a colleague, Arthur March, to be offered a post as his assistant. 

The request for March stemmed from Schrodinger's unconventional relationships with women: although his relations 
with his wife Anny were good, he had had many lovers with his wife's full knowledge (and in fact, Anny had her 
own lover, Hermann Weyl). Schrodinger asked for March to be his assistant because, at that time, he was in love 
with March's wife Hilde. 

Many of the scientists who had left Germany spent mid- 1933 in the Italian province of Bolzano. Here Hilde became 
pregnant with Schrodinger's child. On 4 November 1933 Schrodinger, his wife and Hilde March arrived in Oxford. 
Schrodinger had been elected a fellow of Magdalen College. Soon after they arrived in Oxford, Schrodinger heard 
that, for his work on wave mechanics, he had been awarded the Nobel prize. 

In early 1934 Schrodinger was invited to lecture at Princeton University and while there he was made an offer of a 
permanent position. On his return to Oxford he negotiated about salary and pension conditions at Princeton but in the 
end he did not accept. It is thought that the fact that he wished to live at Princeton with Anny and Hilde both sharing 
the upbringing of his child was not found acceptable. The fact that Schrodinger openly had two wives, even if one of 
them was married to another man, was not well received in Oxford either. Nevertheless, his daughter Ruth Georgie 
Erica was born there on 30 May 1934.^^ 

On 4 January 1961, Schrodinger died in Vienna at the age of 73 of tuberculosis. 
He left a widow, Anny (born Annemarie Bertel on 3 December 1896, died 3 
October 1965), and was buried in Alpbach, Austria. 

Legacy 

The philosophical issues raised by Schrodinger's cat are still debated today and 
remains his most enduring legacy in popular science, while Schrodinger's 
equation is his most enduring legacy at a more technical level. The huge crater 
Schrodinger, on the far side of the Moon is named after him. The Erwin 
Schrodinger International Institute for Mathematical Physics was established in 
Vienna in 1993. 

Color 

One of Schrodinger's lesser-known areas of scientific contribution was his work 
on color, color perception, and colorimetry {Farbenmetrik). In 1920, he 
published three papers in this area: 

Erwin Schrodinger's gravesite 

• "Theorie der Pigmente von groBter Leuchtkraft," Annalen der Physik, (4), 62, 
(1920), 603-622 

• "Grundlinien einer Theorie der Farbenmetrik im Tagessehen," Annalen der Physik, (4), 63, (1920), 397-426; 
427-456; 481-520 (Outline of a theory of color measurement for daylight vision) 
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• "Farbenmetrik," Zeitschrift fiir Physik, 1, (1920), 459-466 (Color measurement). 

The second of these is available in English as "Outline of a Theory of Color Measurement for Daylight Vision" in 
Sources of Color Science, Ed. David L. Mac Adam, The MIT Press (1970), 134-182. 
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John von Neumann 



The native form of this personal name is Neumann Jdnos. This article uses the Western name order. 
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John von Neumann (English pronunciation: /vDn "n0im9n/) (December 28, 1903 - February 8, 1957) was a 
Hungarian-born American mathematician who made major contributions to a vast range of fields } ^ including set 
theory, functional analysis, quantum mechanics, ergodic theory, continuous geometry, economics and game theory, 
computer science, numerical analysis, hydrodynamics (of explosions), and statistics, as well as many other 
mathematical fields. He is generally regarded as one of the greatest mathematicians in modern history. 1 J The 
mathematician Jean Dieudonne called von Neumann "the last of the great mathematicians", while Peter Lax 
described him as possessing the most "fearsome technical prowess" and "scintillating intellect" of the century J 4 ^ 
Even in Budapest, in the time that produced geniuses like Theodore von Karman (b. 1881), Leo Szilard (b. 1898), 
Eugene Wigner (b. 1902), and Edward Teller (b. 1908), his brilliance stood out.^ 

Von Neumann was a pioneer of the application of operator theory to quantum mechanics, in the development of 
functional analysis, a principal member of the Manhattan Project and the Institute for Advanced Study in Princeton 
(as one of the few originally appointed), and a key figure in the development of game theory ^ ^ and the concepts 
of cellular automata^ and the universal constructor. Along with Teller and Stanislaw Ulam, von Neumann worked 
out key steps in the nuclear physics involved in thermonuclear reactions and the hydrogen bomb. 



Biography 

The eldest of three brothers, von Neumann was born Neumann Janos Lajos (Hungarian pronunciation: ['nojmDn ja:noJ 
lDjoj]; in Hungarian the family name comes first) on December 28, 1903 in Budapest, Austro-Hungarian Empire, to 
somewhat wealthy Jewish parents ^ ^ His father was Neumann Miksa (Max Neumann), a lawyer who worked in 
a bank. His mother was Kann Margit (Margaret Kann). 

Janos, nicknamed "Jancsi" (Johnny), was a child prodigy who showed an aptitude for languages, memorization, and 
mathematics. By the age of six, he could exchange jokes in Classical Greek, memorize telephone directories, and 
display prodigious mental calculation abilities J 10 ^ He entered the Hungarian-speaking Lutheran high school Fasori 
Evangelikus Gimnazium in Budapest in 1911. Although he attended school at the grade level appropriate to his age, 
his father hired private tutors to give him advanced instruction in those areas in which he had displayed an aptitude. 
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Recognized as a mathematical prodigy, at the age of 15 he began to study under Gabor Szego. On their first meeting, 
Szego was so impressed with the boy's mathematical talent that he was brought to tears J 1 ^ In 1913, his father was 
rewarded with ennoblement for his service to the Austro-Hungarian empire. (After becoming semi-autonomous in 
1867, Hungary had found itself in need of a vibrant mercantile class.) The Neumann family thus acquiring the title 
margittai, Neumann Janos became margittai Neumann Janos (John Neumann of Margitta), which he later changed to 
the German Johann von Neumann. He received his Ph.D. in mathematics (with minors in experimental physics and 
chemistry) from Pazmany Peter University in Budapest at the age of 22. ^ He simultaneously earned his diploma in 
chemical engineering from the ETH Zurich in Switzerland^ at the behest of his father, who wanted his son to invest 
his time in a more financially viable endeavour than mathematics. Between 1926 and 1930, he taught as a 
Privatdozent at the University of Berlin, the youngest in its history. By age 25, he had published ten major papers, 
and by 30, nearly 36. 

His father, Max von Neumann died in 1929. In 1930, von Neumann, his mother, and his brothers emigrated to the 
United States. He anglicized his first name to John, keeping the Austrian-aristocratic surname of von Neumann, 
whereas his brothers adopted surnames Vonneumann and Neumann (using the de Neumann form briefly when first 
in the U.S.). 

Von Neumann was invited to Princeton University, New Jersey in 1930, and, subsequently, was one of the first four 
people selected for the faculty of the Institute for Advanced Study (two of the others being Albert Einstein and Kurt 
Godel), where he remained a mathematics professor from its formation in 1933 until his death. 

In 1937, von Neumann became a naturalized citizen of the U.S. In 1938, von Neumann was awarded the Bocher 
Memorial Prize for his work in analysis. 

Von Neumann married twice. He married Mariette Kovesi in 1930, just prior to emigrating to the United States. 
They had one daughter (von Neumann's only child), Marina, who is now a distinguished professor of international 
trade and public policy at the University of Michigan. The couple divorced in 1937. In 1938, von Neumann married 
Klara Dan, whom he had met during his last trips back to Budapest prior to the outbreak of World War II. The von 
Neumanns were very active socially within the Princeton academic community, and it is from this aspect of his life 
that many of the anecdotes which surround von Neumann's legend originate. 

In 1955, von Neumann was diagnosed with what was either bone 
ri2i 

or pancreatic cancer. Von Neumann died a year and a half later, 

in great pain. While at Walter Reed Hospital in Washington, D.C., 

he invited a Roman Catholic priest, Father Anselm Strittmatter, 

O.S.B., to visit him for consultation (a move which shocked some 

ri3i 

of von Neumann's friends). The priest then administered to him 
the last Sacraments J 14 ^ He died under military security lest he 
reveal military secrets while heavily medicated. John von 
Neumann was buried at Princeton Cemetery in Princeton, Mercer 
Gravestone of John von Neumann County, New Jersey . [15] 

Von Neumann wrote 150 published papers in his life; 60 in pure 
mathematics, 20 in physics, and 60 in applied mathematics. His last work, written while in the hospital and later 
published in book form as The Computer and the Brain, gives an indication of the direction of his interests at the 
time of his death. 
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Logic and set theory 

The axiomatization of mathematics, on the model of Euclid's Elements, had reached new levels of rigor and breadth 
at the end of the 19th century, particularly in arithmetic (thanks to Richard Dedekind and Giuseppe Peano) and 
geometry (thanks to David Hilbert). At the beginning of the twentieth century, set theory, the new branch of 
mathematics discovered by Georg Cantor, and thrown into crisis by Bertrand Russell with the discovery of his 
famous paradox (on the set of all sets which do not belong to themselves), had not yet been formalized. The problem 
of an adequate axiomatization of set theory was resolved implicitly about twenty years later (by Ernst Zermelo and 
Abraham Fraenkel) by way of a series of principles which allowed for the construction of all sets used in the actual 
practice of mathematics, but which did not explicitly exclude the possibility of the existence of sets which belong to 
themselves. In his doctoral thesis of 1925, von Neumann demonstrated how it was possible to exclude this possibility 
in two complementary ways: the axiom of foundation and the notion of class. 

The axiom of foundation established that every set can be constructed from the bottom up in an ordered succession 
of steps by way of the principles of Zermelo and Fraenkel, in such a manner that if one set belongs to another then 
the first must necessarily come before the second in the succession (hence excluding the possibility of a set 
belonging to itself.) To demonstrate that the addition of this new axiom to the others did not produce contradictions, 
von Neumann introduced a method of demonstration (called the method of inner models) which later became an 
essential instrument in set theory. 

The second approach to the problem took as its base the notion of class, and defines a set as a class which belongs to 
other classes, while a proper class is defined as a class which does not belong to other classes. Under the 
Zermelo/Fraenkel approach, the axioms impede the construction of a set of all sets which do not belong to 
themselves. In contrast, under the von Neumann approach, the class of all sets which do not belong to themselves 
can be constructed, but it is a proper class and not a set. 

With this contribution of von Neumann, the axiomatic system of the theory of sets became fully satisfactory, and the 
next question was whether or not it was also definitive, and not subject to improvement. A strongly negative answer 
arrived in September 1930 at the historic mathematical Congress of Konigsberg, in which Kurt Godel announced his 
first theorem of incompleteness: the usual axiomatic systems are incomplete, in the sense that they cannot prove 
every truth which is expressible in their language. This result was sufficiently innovative as to confound the majority 
of mathematicians of the time. But von Neumann, who had participated at the Congress, confirmed his fame as an 
instantaneous thinker, and in less than a month was able to communicate to Godel himself an interesting 
consequence of his theorem: namely that the usual axiomatic systems are unable to demonstrate their own 
consistency. It is precisely this consequence which has attracted the most attention, even if Godel originally 
considered it only a curiosity, and had derived it independently anyway (it is for this reason that the result is called 
Godel's second theorem, without mention of von Neumann.) 

Quantum mechanics 

At the International Congress of Mathematicians of 1900, David Hilbert presented his famous list of twenty-three 
problems considered central for the development of the mathematics of the new century. The sixth of these was the 
axiomatization of physical theories. Among the new physical theories of the century the only one which had yet to 
receive such a treatment by the end of the 1930s was quantum mechanics. Quantum mechanics found itself in a 
condition of foundational crisis similar to that of set theory at the beginning of the century, facing problems of both 
philosophical and technical natures. On the one hand, its apparent non-determinism had not been reduced to an 
explanation of a deterministic form. On the other, there still existed two independent but equivalent heuristic 
formulations, the so-called matrix mechanical formulation due to Werner Heisenberg and the wave mechanical 
formulation due to Erwin Schrodinger, but there was not yet a single, unified satisfactory theoretical formulation. 
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After having completed the axiomatization of set theory, von Neumann began to confront the axiomatization of 
quantum mechanics. He immediately realized, in 1926, that a quantum system could be considered as a point in a 
so-called Hilbert space, analogous to the 6N dimension (N is the number of particles, 3 general coordinate and 3 
canonical momentum for each) phase space of classical mechanics but with infinitely many dimensions 
(corresponding to the infinitely many possible states of the system) instead: the traditional physical quantities (e.g., 
position and momentum) could therefore be represented as particular linear operators operating in these spaces. The 
physics of quantum mechanics was thereby reduced to the mathematics of the linear Hermitian operators on Hilbert 
spaces. 

For example, the famous uncertainty principle of Heisenberg, according to which the determination of the position of 
a particle prevents the determination of its momentum and vice versa, is translated into the non-commutativity of the 
two corresponding operators. This new mathematical formulation included as special cases the formulations of both 
Heisenberg and Schrodinger, and culminated in the 1932 classic The Mathematical Foundations of Quantum 
Mechanics. However, physicists generally ended up preferring another approach to that of von Neumann (which was 
considered elegant and satisfactory by mathematicians). This approach was formulated in 1930 by Paul Dirac. 

Von Neumann's abstract treatment permitted him also to confront the foundational issue of determinism vs. 
non-determinism and in the book he demonstrated a theorem according to which quantum mechanics could not 
possibly be derived by statistical approximation from a deterministic theory of the type used in classical mechanics. 
This demonstration contained a conceptual error, but it helped to inaugurate a line of research which, through the 
work of John Stuart Bell in 1964 on Bell's Theorem and the experiments of Alain Aspect in 1982, demonstrated that 
quantum physics requires a notion of reality substantially different from that of classical physics. 

Economics and game theory 

Von Neumann's first significant contribution to economics was the minimax theorem of 1928. This theorem 
establishes that in certain zero sum games with perfect information (i.e., in which players know at each time all 
moves that have taken place so far), there exists a strategy for each player which allows both players to minimize 
their maximum losses (hence the name minimax). When examining every possible strategy, a player must consider 
all the possible responses of the player's adversary and the maximum loss. The player then plays out the strategy 
which will result in the minimization of this maximum loss. Such a strategy, which minimizes the maximum loss, is 
called optimal for both players just in case their minimaxes are equal (in absolute value) and contrary (in sign). If the 
common value is zero, the game becomes pointless. 

Von Neumann eventually improved and extended the minimax theorem to include games involving imperfect 
information and games with more than two players. This work culminated in the 1944 classic Theory of Games and 
Economic Behavior (written with Oskar Morgenstern). The public interest in this work was such that The New York 
Times ran a front page story, something which only Einstein had previously elicited. 

Von Neumann's second important contribution in this area was the solution, in 1937, of a problem first described by 
Leon Walras in 1874, the existence of situations of equilibrium in mathematical models of market development 
based on supply and demand. He first recognized that such a model should be expressed through disequations and 
not equations, and then he found a solution to Walras' problem by applying a fixed-point theorem derived from the 
work of L. E. J. Brouwer. The lasting importance of the work on general equilibria and the methodology of fixed 
point theorems is underscored by the awarding of Nobel prizes in 1972 to Kenneth Arrow, in 1983 to Gerard Debreu, 
and in 1994 to John Nash who had improved von Neumann's theory in his Princeton Ph.D thesis. 

Von Neumann was also the inventor of the method of proof, used in game theory, known as backward induction 
(which he first published in 1944 in the book co-authored with Morgenstern, Theory of Games and Economic 
Behaviour) J 16 -' 
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Mathematical statistics and econometrics 

Von Neumann made some fundamental contributions to mathematical statistics. In 1941, he derived the exact 

ri7i 

distribution of the ratio of mean square successive difference to the variance for normally distributed variables. 

This ratio was applied to the residuals from regression models and is commonly known as the Durbin- Watson 
risi 

statistic for testing the null hypothesis that the errors are serially independent against the alternative that they 

ri9i 

follow a stationary first order autoregression. Subsequently, John Denis Sargan and Alok Bhargava extended the 
results for testing if the errors on a regression model follow a Gaussian random walk (i.e. possess a unit root) against 
the alternative that they are a stationary first order autoregression. Von Neumann's contributions to statistics have 
had a major impact on econometric methodology. 



Nuclear weapons 

Beginning in the late 1930s, von Neumann began to take more of an interest in 
applied (as opposed to pure) mathematics. In particular, he developed an 
expertise in explosions — phenomena which are difficult to model 
mathematically. This led him to a large number of military consultancies, 
primarily for the Navy, which in turn led to his involvement in the Manhattan 
Project. The involvement included frequent trips by train to the project's secret 
research facilities in Los Alamos, New Mexico 

Von Neumann's principal contribution to the atomic bomb itself was in the 
concept and design of the explosive lenses needed to compress the plutonium 
core of the Trinity test device and the "Fat Man" weapon that was later dropped 
on Nagasaki. While von Neumann did not originate the "implosion" concept, he 
was one of its most persistent proponents, encouraging its continued 
development against the instincts of many of his colleagues, who felt such a 
design to be unworkable. The lens shape design work was completed by July 
1944. 

In a visit to Los Alamos in September 1944, von Neumann showed that the pressure increase from explosion shock 
wave reflection from solid objects was greater than previously believed if the angle of incidence of the shock wave 
was between 90° and some limiting angle. As a result, it was determined that the effectiveness of an atomic bomb 
would be enhanced with detonation some kilometers above the target, rather than at ground level P 0 ^ 

Beginning in the spring of 1945, along with four other scientists and various military personnel, von Neumann was 
included in the target selection committee responsible for choosing the Japanese cities of Hiroshima and Nagasaki as 
the first targets of the atomic bomb. Von Neumann oversaw computations related to the expected size of the bomb 
blasts, estimated death tolls, and the distance above the ground at which the bombs should be detonated for optimum 
shock wave propagation and thus maximum effect. The cultural capital Kyoto, which had been spared the 
firebombing inflicted upon militarily significant target cities like Tokyo in World War II, was von Neumann's first 
choice, a selection seconded by Manhattan Project leader General Leslie Groves. However, this target was dismissed 
by Secretary of War Henry Stimson. 1 

On July 16, 1945, with numerous other Los Alamos personnel, von Neumann was an eyewitness to the first atomic 
bomb blast, conducted as a test of the implosion method device, 35 miles (56 km) southeast of Socorro, New 
Mexico. Based on his observation alone, von Neumann estimated the test had resulted in a blast equivalent to 5 
kilotons of TNT, but Enrico Fermi produced a more accurate estimate of 10 kilotons by dropping scraps of torn-up 
paper as the shock wave passed his location and watching how far they scattered. The actual power of the explosion 
had been between 20 and 22 kilotons J 20 ^ 




John von Neumann's wartime Los 
Alamos ID badge photo. 
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After the war, Robert Oppenheimer remarked that the physicists involved in the Manhattan project had "known sin". 
Von Neumann's response was that "sometimes someone confesses a sin in order to take credit for it." 

Von Neumann continued unperturbed in his work and became, along with Edward Teller, one of those who sustained 
the hydrogen bomb project. He then collaborated with Klaus Fuchs on further development of the bomb, and in 1946 
the two filed a secret patent on "Improvement in Methods and Means for Utilizing Nuclear Energy", which outlined 
a scheme for using a fission bomb to compress fusion fuel to initiate a thermonuclear reaction. The Fuchs- von 
Neumann patent used radiation implosion, but not in the same way as is used in what became the final hydrogen 
bomb design, the Teller-Ulam design. Their work was, however, incorporated into the "George" shot of Operation 
Greenhouse, which was instructive in testing out concepts that went into the final design. The Fuchs-von Neumann 
work was passed on, by Fuchs, to the USSR as part of his nuclear espionage, but it was not used in the Soviet's own, 
independent development of the Teller-Ulam design. The historian Jeremy Bernstein has pointed out that ironically, 
"John von Neumann and Klaus Fuchs, produced a brilliant invention in 1946 that could have changed the whole 
course of the development of the hydrogen bomb, but was not fully understood until after the bomb had been 
successfully made."^ 



Computer science 

Von Neumann's hydrogen bomb work was also played out in the realm of computing, where he and Stanislaw Ulam 
developed simulations on von Neumann's digital computers for the hydrodynamic computations. During this time he 
contributed to the development of the Monte Carlo method, which allowed complicated problems to be 
approximated using random numbers. Because using lists of "truly" random numbers was extremely slow, von 
Neumann developed a form of making pseudorandom numbers, using the middle-square method. Though this 
method has been criticized as crude, von Neumann was aware of this: he justified it as being faster than any other 
method at his disposal, and also noted that when it went awry it did so obviously, unlike methods which could be 
subtly incorrect. 

While consulting for the Moore School of Electrical Engineering at the University of Pennsylvania on the ED VAC 
project, von Neumann wrote an incomplete First Draft of a Report on the EDM AC. The paper, which was widely 
distributed, described a computer architecture in which the data and the program are both stored in the computer's 
memory in the same address space. This architecture is to this day the basis of modern computer design, unlike the 
earliest computers that were 'programmed' by altering the electronic circuitry. Although the single-memory, stored 
program architecture is commonly called von Neumann architecture as a result of von Neumann's paper, the 
architecture's description was based on the work of J. Presper Eckert and John William Mauchly, inventors of the 
ENIAC at the University of Pennsylvania. [25] 

Von Neumann also created the field of cellular automata without the aid of computers, constructing the first 
self-replicating automata with pencil and graph paper. The concept of a universal constructor was fleshed out in his 
posthumous work Theory of Self Reproducing Automata} 26 ^ Von Neumann proved that the most effective way of 
performing large-scale mining operations such as mining an entire moon or asteroid belt would be by using 
self-replicating machines, taking advantage of their exponential growth. 

He is credited with at least one contribution to the study of algorithms. Donald Knuth cites von Neumann as the 
inventor, in 1945, of the merge sort algorithm, in which the first and second halves of an array are each sorted 
recursively and then merged together. 1 His algorithm for simulating a fair coin with a biased coin 1 is used in the 
"software whitening" stage of some hardware random number generators. 

He also engaged in exploration of problems in numerical hydrodynamics. With R. D. Richtmyer he developed an 
algorithm defining artificial viscosity that improved the understanding of shock waves. It is possible that we would 
not understand much of astrophysics, and might not have highly developed jet and rocket engines without that work. 
The problem was that when computers solve hydrodynamic or aerodynamic problems, they try to put too many 
computational grid points at regions of sharp discontinuity (shock waves). The artificial viscosity was a 
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mathematical trick to slightly smooth the shock transition without sacrificing basic physics. 



Politics and social affairs 

Von Neumann obtained at the age of 29 one of the first five professorships at the new Institute for Advanced Study 
in Princeton, New Jersey (another had gone to Albert Einstein). He was a frequent consultant for the Central 
Intelligence Agency, the United States Army, the RAND Corporation, Standard Oil, IBM, and others. 

Throughout his life von Neumann had a respect and admiration for business and government leaders; something 
which was often at variance with the inclinations of his scientific colleagues. He enjoyed associating with persons in 
positions of power, and this led him into government service. 

As President of the Von Neumann Committee for Missiles, and later as a member of the United States Atomic 
Energy Commission, from 1953 until his death in 1957, he was influential in setting U.S. scientific and military 
policy. Through his committee, he developed various scenarios of nuclear proliferation, the development of 
intercontinental and submarine missiles with atomic warheads, and the controversial strategic equilibrium called 
mutual assured destruction. During a Senate committee hearing he described his political ideology as "violently 
anti-communist, and much more militaristic than the norm". 

Von Neumann's interest in meteorological prediction led him to propose manipulating the environment by spreading 
colorants on the polar ice caps to enhance absorption of solar radiation (by reducing the albedo), thereby raising 
global temperatures. He also favored a preemptive nuclear attack on the Soviet Union, believing that doing so could 
prevent it from obtaining the atomic bomb P 0 ^ 



Personality 

Von Neumann invariably wore a conservative grey flannel business suit - he was even known to play tennis wearing 

his business suit - and he enjoyed throwing large parties at his home in Princeton, occasionally twice a week. His 

T321 

white clapboard house at 26 Westcott Road was one of the largest in Princeton. 1 Despite being a notoriously bad 
driver, he nonetheless enjoyed driving (frequently while reading a book) - occasioning numerous arrests as well as 
accidents. When Cuthbert Hurd hired him as a consultant to IBM, Hurd often quietly paid the fines for his traffic 
tickets/ 33 ^ 

It was said of him at Princeton that, while he was indeed a demigod, he had made a detailed study of humans and 
could imitate them perfectly. t34] 

Von Neumann liked to eat and drink heavily; his wife, Klara, said that he could count everything except calories. He 

ri4i 

enjoyed Yiddish and "off-color" humor (especially limericks). 



Honors 

The John von Neumann Theory Prize of the Institute for Operations Research and the Management Sciences 
(INFORMS, previously TIMS-ORSA) is awarded annually to an individual (or group) who have made fundamental 
and sustained contributions to theory in operations research and the management sciences. 

The IEEE John von Neumann Medal is awarded annually by the IEEE "for outstanding achievements in 
computer-related science and technology." 

The John von Neumann Lecture is given annually at the Society for Industrial and Applied Mathematics (SIAM) by 
a researcher who has contributed to applied mathematics, and the chosen lecturer is also awarded a monetary prize. 

The crater Von Neumann on the Moon is named after him. 

The John von Neumann Computing Center in Princeton, New Jersey (40°20'55"N 74°35'32"W) was named in his 
honour. 
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The professional society of Hungarian computer scientists, John von Neumann Computer Society, is named after 
John von Neumann P 5 ^ 

On February 15, 1956, Neumann was presented with the Presidential Medal of Freedom by President Dwight 
Eisenhower. 

On May 4, 2005 the United States Postal Service issued the American Scientists commemorative postage stamp 
series, a set of four 37-cent self-adhesive stamps in several configurations. The scientists depicted were John von 
Neumann, Barbara McClintock, Josiah Willard Gibbs, and Richard Feynman. 

The John von Neumann Award of The Rajk Laszlo College for Advanced Studies was named in his honour, and has 
been given every year since 1995 to professors who have made an outstanding contribution to the exact social 
sciences and through their work have strongly influenced the professional development and thinking of the members 
of the college. 
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• Budapest Tech Polytechnical Institution - John von Neumann Faculty of Informatics (http://nik.bmf.hu/) 

• John von Neumann speaking at the dedication of the NORD (http://elm.eeng.dcu.ie/~alife/ 
von-neumann-1954-NORD/), December 2, 1954 (audio recording) 

• The American Presidency Project (http://www.presidency.ucsb.edu/ws/index.php ?pid=10735) 

• John Von Neumann Memorial (http://www.findagrave.com/cgi-bin/fg.cgi?page=gr&GRid=7333144) at Find 
A Grave 
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John Hasbrouck Van Vleck 


Born 


March 13, 1899Middletown, Connecticut 


Died 


October 27, 1980Cambridge, Massachusetts 


Nationality 


United States 


Fields 


Physics 


Institutions 


University of Minnesota 
University of Wisconsin-Madison 
Harvard University 
University of Oxford 
Balliol College 


Alma mater 


University of Wisconsin-Madison 
Harvard University 


Doctoral advisor 


Edwin C. Kemble 


Doctoral students 


Robert Serber 
Edward Mills Purcell 
Philip Anderson 


Notable awards 


Elliott Cresson Medal (1971) 
Nobel Prize in Physics (1977) 



John Hasbrouck Van Vleck (March 13, 1899 - October 27, 1980) was an American physicist and mathematician, 
co-awarded the 1977 Nobel Prize in Physics, for his contributions to the understanding of the behavior of electrons 
in magnetic solids. 



Life and work 

Born in Middletown, Connecticut the son of mathematician Edward Burr Van Vleck and grandson of astronomer 
John Monroe Van Vleck, he grew up in Madison, Wisconsin, received A.B. degree from the University of 
Wisconsin-Madison in 1920. Then he went to Harvard for graduate studies and earned a Ph.D degree in 1922. He 
joined the University of Minnesota as an assistant professor in 1923, then moved to the University of 
Wisconsin-Madison before settling at Harvard. He also earned Honorary D. Sc. , or D. Honoris Causa, degree from 
Wesley an University in 1936. ^ 

J. H. van Vleck established the fundamentals of the quantum mechanical theory of magnetism and the crystal field 
theory (chemical bonding in metal complexes). He is regarded as the Father of Modern Magnetism. ^ ^ ^ 

During World War II, J. H. van Vleck worked on radar at the MIT Radiation Lab. He was half time at the Radiation 
Lab and half time on the staff at Harvard. He showed that at about 1.25 -centimeter wavelength water molecules in 
the atmosphere would lead to troublesome absorption and that at 0.5 -centimeter wavelength there would be a similar 
absorption by oxygen molecules. ^ ^ ^ ^ This was to have important consequences not just for military (and 
civil) radar systems but later for the new science of radioastronomy. 

J. H. van Vleck participated in the Manhattan Project. In June 1942, J. Robert Oppenheimer held a summer study for 
confirming the concept and feasibility of nuclear weapon at the University of California, Berkeley. Eight theoretical 
scientists, including J. H. van Vleck, attended it. From July to September, the theoretical study group examined and 
developed the principles of atomic bomb design. ^ tl0] J. H. van Vleck's theoretical work led to establish the Los 
Alamos Nuclear Weapons Laboratory. He also served on the Los Alamos Review committee in 1943. The 
committee, established by General Leslie Groves, also consisted of W.K. Lewis of MIT, Chairman; E.L. Rose, of 
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Jones & Lamson; E.B. Wilson of Harvard; and Richard C. Tolman, Vice Chairman of NDRC. The committee's 
important contribution (originating with Rose) was a reduction in the size of the firing gun for the Little Boy atomic 
bomb, a concept which eliminated additional design- weight and sped up production of the bomb for its eventual 
release over Hiroshima. However it was not employed for the Fat Man bomb at Nagasaki, which relied on implosion 
of a plutonium shell to reach critical mass. ^ ^ 

In the year 1961-62 he was George Eastman Visiting Professor at University of Oxford^ and Professorship of 

Balliol College [15] . He was awarded the National Medal of Science in 1966 ^ and the Lorentz Medal in 1974^ 17 ^ 

For his contributions to the understanding of the behavior of electrons in magnetic solids, van Vleck was awarded 

the Nobel Prize in Physics 1977, along with Philip W. Anderson and Sir Nevill MottJ 18 ^ Van Vleck transformations 

ri9i 

and Van Vleck paramagnetism are also named after him. 
Dr. Van Vleck died in Cambridge, Massachusetts, aged 81. ^ 



Writing 

• The Absorption of Radiation by Multiply Periodic Orbits, and its Relation to the Correspondence Principle and 

Hi] 

the Ray lei gh- Jeans Law. Part I. Some Extensions of the Correspondence Principle , Physical Review, vol. 24, 
Issue 4, pp. 330-346 (1924) 

• The Absorption of Radiation by Multiply Periodic Orbits, and its Relation to the Correspondence Principle and 
the Rayleigh- Jeans Law. Part II. Calculation of Absorption by Multiply Periodic Orbits , Physical Review, 
vol. 24, Issue 4, pp. 347-365 (1924) 

• Quantum Principles and Line Spectra, (Bulletin of the National Research Council; v. 10, pt 4, no. 54, 1926) 

• The Theory of Electric and Magnetic Susceptibilities (Oxford at Clarendon, 1932). 

T231 

• Quantum Mechanics, The Key to Understanding Magnetism , Nobel Lecture, December 8, 1977 



Japanese art collector 

J. H. van Vleck and his wife Abigail were also important art collectors, particularly in the medium of Japanese 
woodblock prints (principally Ukiyo-e), known as Van Vleck Collection. It was inherited from his father Edward 
Burr Van Vleck. They donated it to the Chazen Museum of Art in Madison, Wisconsin in 1980s. ^ 
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Paul Adrien Maurice Dirac 








Born 


Paul Adrien Maurice Dirac8 August 1902Bristol, England 


Died 


20 October 1984 (aged 82)Tallahassee, Florida, USA 


Nationality 


Switzerland (until 1919) 
United Kingdom (from 1919) [1] 


Fields 


Physics 


Institutions 


University of Cambridge 
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Alma mater 
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University of Cambridge 


Doctoral advisor 


Ralph Fowler 


Doctoral students 
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Known for 


Dirac equation 
Dirac comb 
Dirac delta function 
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Dirac large numbers hypothesis 
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Dirac Picture 

Dirac-Coulomb-Breit Equation 
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Nobel Prize in Physics (1933) 
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Paul Dirac 



584 



Notes 

He is the stepfather of Gabriel Andrew Dirac. 

Paul Adrien Maurice Dirac, OM, FRS (pronounced /di'raek/ di-RAK; 8 August 1902 - 20 October 1984) was an 
English theoretical physicist who was one of the founders of quantum mechanics and quantum electrodynamics. 
Dirac made fundamental contributions to the early development of both quantum mechanics and quantum 
electrodynamics. He held the Lucasian Chair of Mathematics at the University of Cambridge and spent the last 
fourteen years of his life at Florida State University. 

Among other discoveries, he formulated the Dirac equation, which describes the behaviour of fermions, and 
predicted the existence of antimatter. 

Dirac shared the Nobel Prize in physics for 1933 with Erwin Schrodinger, "for the discovery of new productive 
forms of atomic theory.' 

Early years 

Paul Dirac was born in Bristol, England and grew up in the Bishopston area of the city. His father, Charles Dirac, 
was an immigrant from Saint-Maurice in the Canton of Valais, Switzerland. His mother was originally from 
Cornwall and the daughter of a mariner. Paul had an elder brother, Felix, who committed suicide in March 1925, and 
a younger sister, Beatrice. His early family life appears to have been unhappy due to his father's unusually strict and 
authoritarian nature. Dirac had a very strained relationship with his father, so much that after his death, he wrote, "I 
feel much freer now, and I am my own man." He was forced to speak only in French at dinner, and was punished for 
any grammatical mistakes. Dirac chose silence over punishment. Dirac, later in life mentioned that it was not until he 
was 23 that he realized that parents were supposed to love their children. His hatred lasted his father's entire life. 

Dirac was educated first at Bishop Road Primary School and then at Merchant Venturers' Technical College (later 
Cotham School), where his father was a French teacher. The school was an institution attached to the University of 
Bristol, which emphasized scientific subjects and modern languages. This was an unusual arrangement at a time 
when secondary education in Britain was still dedicated largely to the classics, and something for which Dirac would 
later express gratitude. 

Dirac studied electrical engineering at the University of Bristol, completing his degree in 1921. He then decided that 
his true calling lay in the mathematical sciences and, after completing a BA in applied mathematics at Bristol in 
1923, he received a grant to conduct research at St John's College, Cambridge, where he would remain for most of 
his career. At Cambridge, Dirac pursued his interests in the theory of general relativity (an interest he gained earlier 
as a student in Bristol) and in the nascent field of quantum physics, under the supervision of Ralph Fowler. 

Career 

Dirac noticed an analogy between the Poisson brackets of classical mechanics and the recently proposed quantization 
rules in Werner Heisenberg's matrix formulation of quantum mechanics. This observation allowed Dirac to obtain 
the quantization rules in a novel and more illuminating manner. For this work, published in 1926, he received a 
Ph.D. from Cambridge. 

In 1928, building on 2x2 spin matrices which he discovered independently (Abraham Pais quoted Dirac as saying "I 

Mi 

believe I got these (matrices) independently of Pauli and possibly Pauli got these independently of me" ) of 
Wolfgang Pauli's work on non-relativistic spin systems, he proposed the Dirac equation as a relativistic equation of 
motion for the wavefunction of the electron This work led Dirac to predict the existence of the positron, the 
electron's antiparticle, which he interpreted in terms of what came to be called the Dirac sea. The positron was 
observed by Carl Anderson in 1932. Dirac's equation also contributed to explaining the origin of quantum spin as a 
relativistic phenomenon. 
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The necessity of fermions i.e. matter being created and destroyed in Enrico Fermi's 1934 theory of beta decay, 
however, led to a reinterpretation of Dirac's equation as a "classical" field equation for any point particle of spin h/2, 
itself subject to quantization conditions involving anti-commutators. Thus reinterpreted as a (quantum) field equation 
accurately describing quarks and leptons i.e. all elementary matter particles, this Dirac field equation is as central to 
theoretical physics as the Maxwell, Yang-Mills and Einstein field equations. Dirac is regarded as the founder of 
quantum electrodynamics, being the first to use that term. He also introduced the idea of vacuum polarization in the 
early 1930s. This work was key to the development of quantum mechanics by the next generation of theorists, and in 
particular Schwinger, Feynman, Sin-Itiro Tomonaga and Dyson in their formulation of quantum electrodynamics. 

Dirac's Principles of Quantum Mechanics, published in 1930, is a landmark in the history of science. It quickly 
became one of the standard textbooks on the subject and is still used today. In that book, Dirac incorporated the 
previous work of Werner Heisenberg on matrix mechanics and of Erwin Schrodinger on wave mechanics into a 
single mathematical formalism that associates measurable quantities to operators acting on the Hilbert space of 
vectors that describe the state of a physical system. The book also introduced the delta function. Following his 1939 

T71 T81 

article, he also included the bra-ket notation in the third edition of his book, thereby contributing to its universal 
use nowadays. 

In 1933, following his 1931 paper on magnetic monopoles, Dirac showed that the existence of a single magnetic 

rm 

monopole in the universe would suffice to explain the observed quantization of electrical charge. In 1975, 
1982/ 10] and 2009 ^ 1] tl2] tl3] intriguing results suggested the possible detection of magnetic monopoles, but there is, 
to date, no direct evidence for their existence. 

Dirac was the Lucasian Professor of Mathematics at Cambridge from 1932 to 1969. In 1937, he proposed a 
speculative cosmological model based on the so-called large numbers hypothesis. During World War II, he 
conducted important theoretical and experimental research on uranium enrichment by gas centrifuge. 

Dirac's quantum electrodynamics made predictions that were - more often than not - infinite and therefore 
unacceptable. A workaround known as renormalization was developed, but Dirac never accepted this. "I must say 
that I am very dissatisfied with the situation," he said in 1975, "because this so-called 'good theory' does involve 
neglecting infinities which appear in its equations, neglecting them in an arbitrary way. This is just not sensible 
mathematics. Sensible mathematics involves neglecting a quantity when it is small — not neglecting it just because 
it is infinitely great and you do not want it!"'- 14 -' His refusal to accept renormalization, resulted in his work on the 
subject moving increasingly out of the mainstream. However, from his once rejected notes he managed to work on 
putting QED on "logical foundations" based on Hamiltonian formalism that he formulated. He found a rather novel 
way of deriving the anomalous magnetic moment "Schwinger term" and also the Lamb shift, afresh, using the 
Heisenberg picture and without using the joining method used by Weisskopf and French, the two pioneers of modern 
QED, Schwinger and Feynman, in 1963. That was two years before the Tomonaga-Schwinger-Feynman QED was 
given formal recognition by an award of the Nobel Prize for physics. Weisskopf and French (FW) were the first to 
obtain the correct result for the Lamb shift and the anomalous magnetic moment of the electron. At first FW results 
did not agree with the incorrect but independent results of Feynman and Schwinger (Schweber SS 1994 "QED and 
the men who made it: Dyson,Feynman,Schwinger and Tomonaga", Princeton :PUP). The 1963-1964 lectures Dirac 
gave on quantum field theory at Yeshiva University were published in 1966 as the Belfer Graduate School of 
Science, Monograph Series Number, 3. After having relocated to Florida in order to be near his elder daughter, 
Mary, Dirac spent his last fourteen years (of both life and physics research) at the University of Miami in Coral 
Gables, Florida and Florida State University in Tallahassee, Florida. 

In the 1950s in his search for a better QED, Paul Dirac developed the Hamiltonian theory of constraints (Canad J 
Math 1950 vol 2, 129; 1951 vol 3, 1) based on lectures that he delivered at the 1949 International Mathematical 
Congress in Canada. Dirac (1951 "The Hamiltonian Form of Field Dynamics" Canad Jour Math, vol 3,1) had also 
solved the problem of putting the Tomonaga-Schwinger equation into the Schrodinger representation (See Phillips R 
J N 1987 "Tributes to Dirac" p31 London: Adam Hilger) and given explicit expressions for the scalar meson field 
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(spin zero pion or pseudoscalar meson), the vector meson field (spin one rho meson), and the electromagnetic field 
(spin one massless boson, photon). 

The Hamiltonian of constrained systems is one of Dirac's many masterpieces. It is a powerful generalization of 
Hamiltonian theory that remains valid for curved spacetime. The equations for the Hamiltonian involve only six 
degrees of freedom described by 9rs , p rs for each point of the surface on which the state is considered. The <?mO 
(m = 0,1,2,3) appear in the theory only through the variables g r0 , (_£ 00 ) -1 / 2 which occur as arbitrary 
coefficients in the equations of motion. H=J ^ 3 x[ (_ <? 00 ) -1 / 2 - g r0 f g 00 H r ] There are four constraints 
or weak equations for each point of the surface x ®= constant. Three of them inform the four vector density in the 

surface. The fourth is a 3-dimensional scalar density in the surface iJ L «0; H r ~0 (r=l,2,3) 

In the late 1950s he applied the Hamiltonian methods he had developed to cast Einstein's general relativity in 

Hamiltonian form (Proc Roy Soc 1958,A vol 246, 333,Phys Rev 1959,vol 114, 924) and to bring to a technical 

completion the quantization problem of gravitation and bring it also closer to the rest of physics according to Salam 

and DeWitt. In 1959 also he gave an invited talk on "Energy of the Gravitational Field" at the New York Meeting of 

the American Physical Society later published in 1959 Phys Rev Lett 2, 368. In 1964 he published his "Lectures on 

Quantum Mechanics" (London: Academic) which deals with constrained dynamics of nonlinear dynamical systems 

including quantization of curved spacetime. He also published a paper entitled "Quantization of the Gravitational 

Field" in 1967 ICTP/IAEA Trieste Symposium on Contemporary Physics. 

If one considers waves moving in the direction ^resolved into the corresponding Fourier components (r,s = 1,2,3), 
the variables in the degrees of freedom 13,23,33 are affected by the changes in the coordinate system whereas those 
in the degrees of freedom 12, (11-22) remain invariant under such changes. The expression for the energy splits up 
into terms each associated with one of these six degrees of freedom without any cross terms associated with two of 
them. The degrees of freedom 13, 23, 33 do not appear at all in the expression for energy of gravitational waves in 
the direction The two degrees of freedom 12, (11-22) contribute a positive definite amount of such a form to 
represent the energy of gravitational waves. These two degrees of freedom correspond in the language of quantum 
theory , to the gravitational photons (gravitons) with spin +2 or -2 in their direction of motion. The degrees of 
freedom (11+22) gives rise to the Newtonian potential energy term showing the gravitational force between the two 
positive mass is attractive and the self energy of every mass is negative. 

Amongst his many students was John Polkinghorne, who recalls that Dirac "was once asked what was his 
fundamental belief. He strode to a blackboard and wrote that the laws of nature should be expressed in beautiful 

n[15] 

equations. 



Personal life 

Dirac married Eugene Wigner's sister, Margit, in 1937. He adopted Margit's two children, Judith and Gabriel. Paul 
and Margit Dirac had two children together, both daughters, Mary Elizabeth and Florence Monica. 

Margit, known as Manci, visited her brother in 1934 in Princeton from her native Hungary and, while at dinner at the 

Annex Restaurant (1930s-2006^ ), met the "lonely-looking man at the next table." This account came from a 

physicist from Korea who met and was influenced by Dirac, Y.S. Kim, who has also written: "It is quite fortunate for 

the physics community that Manci took good care of our respected Paul A.M. Dirac. Dirac published eleven papers 

during the period 1939-46.... Dirac was able to maintain his normal research productivity only because Manci was in 
ri7i 

charge of everything else." 1 

A reviewer of the 2009 biography writes: "Dirac blamed his [emotional] frailties on his father, a Swiss immigrant 
who bullied his wife, chivvied his children and insisted Paul spoke only French at home, even though the Diracs 
lived in Bristol. 'I never knew love or affection when I was a child,' Dirac once said." She also writes that "[t]he 
problem lay with his genes. Both father and son had autism, to differing degrees. Hence the Nobel winner's 
reticence, literal-mindedness, rigid patterns of behaviour and self-centredness. [Quoting the biography:] 'Dirac's 
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traits as a person with autism were crucial to his success as a theoretical physicist: his ability to order information 
about mathematics and physics in a systematic way, his visual imagination, his self-centredness, his concentration 

MOT 

and determination. ' " 



Personality 

Dirac was known among his colleagues for his precise and taciturn nature. His colleagues in Cambridge jokingly 

ri9i 

defined a unit of a dirac which was one word per hour. 1 When Niels Bohr complained that he did not know how to 
finish a sentence in a scientific article he was writing, Dirac replied, "I was taught at school never to start a sentence 
without knowing the end of it."'- 20 -' He criticized the physicist J. Robert Oppenheimer's interest in poetry: "The aim of 
science is to make difficult things understandable in a simpler way; the aim of poetry is to state simple things in an 
incomprehensible way. The two are incompatible." 1 J 

Dirac himself wrote in his diary during his postgraduate years that he concentrated solely on his research, and only 
stopped on Sunday, when he took long strolls alone. 

An anecdote recounted in a review of the 2009 biography tells of Werner Heisenberg and Dirac sailing on a cruise 

ship to a conference in Japan in August 1929. "Both still in their twenties, and unmarried, they made an odd couple. 

Heisenberg was a ladies' man who constantly flirted and danced, while Dirac — 'an Edwardian geek', as [biographer] 

Graham Farmelo puts it — suffered agonies if forced into any kind of socialising or small talk. 'Why do you dance?' 

Dirac asked his companion. 'When there are nice girls, it is a pleasure,' Heisenberg replied. Dirac pondered this 

n 8i 

notion, then blurted out: 'But, Heisenberg, how do you know beforehand that the girls are nice?" 

According to a story told in different versions, a friend or student visited Dirac, not knowing of his marriage. 

Noticing the visitor's surprise at seeing an attractive woman in the house, Dirac said, "This is... this is Wigner's 

sister". Margit Dirac told both George Gamow and Anton Capri in the 1960s that her husband had actually said, 

P221 [231 

"Allow me to present Wigner's sister, who is now my wife." 

Dirac was also noted for his personal modesty. He called the equation for the time evolution of a 
quantum-mechanical operator, which he was the first to write down, the "Heisenberg equation of motion". Most 
physicists speak of Fermi-Dirac statistics for half-integer-spin particles and Bose-Einstein statistics for integer-spin 
particles. While lecturing later in life, Dirac always insisted on calling the former "Fermi statistics". He referred to 
the latter as "Einstein statistics" for reasons, he explained, of "symmetry". 



Religious views 

Heisenberg recollects a friendly conversation among young participants at the 1927 Solvay Conference about 
Einstein and Planck's views on religion. Wolfgang Pauli, Heisenberg and Dirac took part in it. Dirac's contribution 
was a criticism of the political purpose of religion, which was much appreciated for its lucidity by Bohr when 
Heisenberg reported it to him later. Among other things, Dirac said:'- 24 -' 

I cannot understand why we idle discussing religion. If we are honest — and scientists have to be — we must admit that religion is a jumble of 
false assertions, with no basis in reality. The very idea of God is a product of the human imagination. It is quite understandable why primitive 
people, who were so much more exposed to the overpowering forces of nature than we are today, should have personified these forces in fear 
and trembling. But nowadays, when we understand so many natural processes, we have no need for such solutions. I can't for the life of me see 
how the postulate of an Almighty God helps us in any way. What I do see is that this assumption leads to such unproductive questions as why 
God allows so much misery and injustice, the exploitation of the poor by the rich and all the other horrors He might have prevented. If religion 
is still being taught, it is by no means because its ideas still convince us, but simply because some of us want to keep the lower classes quiet. 
Quiet people are much easier to govern than clamorous and dissatisfied ones. They are also much easier to exploit. Religion is a kind of opium 
that allows a nation to lull itself into wishful dreams and so forget the injustices that are being perpetrated against the people. Hence the close 
alliance between those two great political forces, the State and the Church. Both need the illusion that a kindly God rewards — in heaven if not 
on earth — all those who have not risen up against injustice, who have done their duty quietly and uncomplainingly. That is precisely why the 
honest assertion that God is a mere product of the human imagination is branded as the worst of all mortal sins. 
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Heisenberg's view was tolerant. Pauli, raised as a Catholic had kept silent after some initial remarks, but when finally 
he was asked for his opinion, jokingly he said: "Well, I'd say that also our friend Dirac has got a religion and the first 
commandment of this religion is 'There is no God and Paul Dirac is his prophet."' Everybody burst into laughter, 
including Dirac P 4 ^ 

Death and after 

In 1984, Dirac died in Tallahassee, Florida where he is buried. 

The Dirac-Hellmann Award at FSU was endowed by Dr Bruce P. Hellmann (Dirac's last doctoral student) in 1997 to 
reward outstanding work in theoretical physics by FSU researchers. The Paul A.M. Dirac Science Library at FSU is 
named in his honour. 

In 1995, a plaque in his honour bearing the Dirac equation was unveiled at Westminster Abbey in London with a 
speech from Stephen Hawking. 

His childhood home in Bristol is commemorated with a blue plaque and the nearby Dirac Road is named in 
recognition of his links with the city. 

A commemorative garden has been established opposite the railway station in Saint-Maurice, Switzerland, the town 
of origin of his father's family. 

Honours 

Dirac shared the 1933 Nobel Prize for physics with Erwin Schrodinger "for the discovery of new productive forms of 
atomic theory.' Dirac was also awarded the Royal Medal in 1939 and both the Copley Medal and the Max Planck 
medal in 1952. He was elected a Fellow of the Royal Society in 1930, an Honorary Fellow of the American Physical 
Society in 1948, and an Honorary Fellow of the Institute of Physics, London in 1971. Dirac was awarded the Order 
of Merit, an outstanding recognition by the land of his birth, in 1973. 

In 1975, Dirac gave a series of five lectures at the University of New South Wales which were subsequently 
published as a book, Directions of Physics (Wiley, 1978 - H. Hora and J. Shepanski, eds.). Dirac donated the 
royalties from this book to the University for the establishment of the Dirac Lecture Series. The Silver Dirac 
Medal for the Advancement of Theoretical Physics is awarded by the University of New South Wales on the 
occasion of the Public Dirac Lecture. 

Immediately after his death, two organizations of professional physicists established annual awards in Dirac's 

memory. The Institute of Physics, the United Kingdom's professional body for physicists, awards the Paul Dirac 

Medal and Prize for "outstanding contributions to theoretical (including mathematical and computational) 
[27] 

physics". The first three recipients were Stephen Hawking (1987), John Stewart Bell (1988), and Roger Penrose 
(1989). The Abdus Salam International Centre for Theoretical Physics (ICTP) awards the Dirac Medal of the ICTP 
each year on Dirac's birthday (8 August). Also, the Dirac Prize is awarded by the International Centre for Theoretical 
Physics in his memory. Dirac House in Bristol is the headquarters of Institute of Physics Publishing. 

The street on which the National High Magnetic Field Laboratory in Tallahassee, Florida, is located was named Paul 
Dirac Drive. A second location on the Florida State University campus, the Paul A. M. Dirac Science Library, also 
bears his name. There is also a road named after him in his home town of Bristol, UK. The BBC named its video 
codec Dirac in his honour. 
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Legacy 

Dirac is widely regarded as one of the world's greatest physicists. He was one of the founders of quantum mechanics 
and quantum electrodynamics. 

His early contributions include the modern operator calculus for quantum mechanics, which he called transformation 
theory, and an early version of the path integral He formulated a many-body formalism for quantum mechanics 
which allowed each particle to have its own proper time. 

His relativistic wave equation for the electron was the first successful attack on the problem of relativistic quantum 
mechanics. Dirac founded quantum field theory with his reinterpretation of the Dirac equation as a many-body 
equation, which predicted the existence of antimatter and matter-antimatter annihilation. He was the first to 
formulate quantum electrodynamics, although he could not calculate arbitrary quantities because the short distance 
limit requires renormalization. 

In an attempt to solve the quantum divergence problem, Dirac gave a classical point particle theory combining 
advanced and retarded waves to eliminate the classical electron self-energy. Although these classical methods did 
not immediately solve the problems in quantum electrodynamics, they did lead John Archibald Wheeler and Richard 
Feynman to formulate an alternative Green's function description for light, which eventually led to Feynman's point 
particle formulation of quantum field theory. 

Dirac discovered the magnetic monopole solutions, the first topological configuration in physics, and used them to 
give the modern explanation of charge quantization. He developed constrained quantization in the 1960s, identifying 
the general quantum rules for arbitrary classical systems. 

Dirac's quantum- field analysis of the vibrations of a membrane, in the early 1960s, proved extremely useful to 
modern practitioners of superstring theory and its closely related successor, M-Theory. L 
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Werner Heisenberg (5 December 1901 - 1 February 1976) was a German theoretical physicist who made 
foundational contributions to quantum mechanics and is best known for asserting the uncertainty principle of 
quantum theory. In addition, he made important contributions to nuclear physics, quantum field theory, and particle 
physics. 

Heisenberg, along with Max Born and Pascual Jordan, set forth the matrix formulation of quantum mechanics in 
1925. Heisenberg was awarded the 1932 Nobel Prize in Physics for the creation of quantum mechanics, and its 
application especially to the discovery of the allotropic forms of hydrogen 

Following World War II, he was appointed director of the Kaiser Wilhelm Institute for Physics, which was soon 
thereafter renamed the Max Planck Institute for Physics. He was director of the institute until it was moved to 
Munich in 1958, when it was expanded and renamed the Max Planck Institute for Physics and Astrophysics. 

Heisenberg was also president of the German Research Council, chairman of the Commission for Atomic Physics, 
chairman of the Nuclear Physics Working Group, and president of the Alexander von Humboldt Foundation. 



Biography 

Early years 

Heisenberg was born in Wurzburg, Germany to Kaspar Earnesta August Heisenberg, a secondary school teacher of 
classical languages who became Germany's only ordentlicher Professor (ordinarius professor) of medieval and 
modern Greek studies in the university system and his wife Annie Wecklein. 

He studied physics and mathematics from 1920 to 1923 at the Ludwig-Maximilians-Universitat Munchen and the 
Georg-August-Universitdt Gottingen. At Munich, he studied under Arnold Sommerfeld and Wilhelm Wien. At 
Gottingen, he studied physics with Max Born and James Franck, and he studied mathematics with David Hilbert. He 
received his doctorate in 1923, at Munich under Sommerfeld. He completed his Habilitation in 1924, at Gottingen 
under Born.^ ^ 

Because Sommerfeld had a sincere interest in his students and knew of Heisenberg's interest in Niels Bohr's theories 
on atomic physics, Sommerfeld took Heisenberg to Gottingen to the Bohr-Festspiele (Bohr Festival) in June 1922. 
At the event, Bohr was a guest lecturer and gave a series of comprehensive lectures on quantum atomic physics. 
There, Heisenberg met Bohr for the first time, and it had a significant and continuing effect on him.^ ^ ^ 

Heisenberg's doctoral thesis, the topic of which was suggested by Sommerfeld, was on turbulence; the thesis 
discussed both the stability of laminar flow and the nature of turbulent flow. The problem of stability was 
investigated by the use of the Orr-Sommerfeld equation, a fourth order linear differential equation for small 
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disturbances from laminar flow. He would briefly return to this topic after World War II. 

Heisenberg's paper on the anomalous Zeeman effect^ was accepted as his Habilitationsschrift under Max Born at 
Gottingen. 11 1] 

In his youth he was a member and Scoutleader of the Neupfadfinder, a German Scout association and part of the 
German Youth Movement/ 12 ^ ^ ^ In August 1923 Robert Honsell and Heisenberg organized a trip (Grofifahrt) to 
Finland with a Scout group of this association from Munich. tl5] 

Career 

Gottingen, Copenhagen, and Leipzig 

From 1924 to 1927, Heisenberg was a Privatdozent at Gottingen. From 17 September 1924 to 1 May 1925, under an 
International Education Board Rockefeller Foundation fellowship, Heisenberg went to do research with Niels Bohr, 
director of the Institute of Theoretical Physics at the University of Copenhagen. He returned to Gottingen and with 
Max Born and Pascual Jordan, over a period of about six months, developed the matrix mechanics formulation of 
quantum mechanics. On 1 May 1926, Heisenberg began his appointment as a university lecturer and assistant to 
Bohr in Copenhagen. It was in Copenhagen, in 1927, that Heisenberg developed his uncertainty principle, while 
working on the mathematical foundations of quantum mechanics. In his paper ^ on the uncertainty principle, 
Heisenberg used the word " Ungenauigkeit" (imprecision) ^ ^ 

In 1927, Heisenberg was appointed ordentlicher Professor (ordinarius professor) of theoretical physics and head of 

the department of physics at the Universitat Leipzig; he gave his inaugural lecture on 1 February 1928. In his first 

ri9i 

paper published from Leipzig, Heisenberg used the Pauli exclusion principle to solve the mystery of 
fen-omagnetism. [3][4][17][20] ' 

In Heisenberg's tenure at Leipzig, the quality of doctoral students, post-graduate and research associates who studied 
and worked with Heisenberg there is attested to by the acclaim later earned by these people; at various times, they 
included: Erich Bagge, Felix Bloch, Ugo Fano, Siegfried Fliigge, William Vermillion Houston, Friedrich Hund, 
Robert S. Mulliken, Rudolf Peierls, George Placzek, Isidor Isaac Rabi, Fritz Sauter, John C. Slater, Edward Teller, 
John Hasbrouck van Vleck, Victor Frederick Weisskopf, Carl Friedrich von Weizsacker, Gregor Wentzel and 
Clarence ZenerJ 21 ^ 

[22] [231 

In early 1929, Heisenberg and Wolfgang Pauli submitted the first of two papers laying the foundation for 

relativistic quantum field theory. Also in 1929, Heisenberg went on a lecture tour in the United States, Japan, China, 
andlndia. [17][21] 



Shortly after the discovery of the neutron by James Chadwick in 1932, Heisenberg submitted the first of three 
pap( 

[27] 



papers'^ t25] t26] on his neutron-proton model of the nucleus. He was awarded the 1932 Nobel Prize in Physics. tlT| 



In 1928, the British mathematical physicist P. A. M. Dirac had derived the relativistic wave equation of quantum 
mechanics, which implied the existence of positive electrons, later to be named positrons. In 1932, from a cloud 
chamber photograph of cosmic rays, the American physicist Carl David Anderson identified a track as having been 
made by a positron. In mid-1933, Heisenberg presented his theory of the positron. His thinking on Dirac's theory and 
further development of the theory were set forth in two papers. The first, Bemerkungen zur Dirac schen Theorie des 

T281 

Positrons (Remarks on Dirac's theory of the positron) was published in 1934, and the second, Folgerungen aus 

ri7i 

der Diracschen Theorie des Positrons (Consequences of Dirac's Theory of the Positron), was published in 1936. 
[29] [30] 

In the early 1930s in Germany, the deutsche Physik movement was anti-Semitic and anti-theoretical physics, 

especially including quantum mechanics and the theory of relativity. As applied in the university environment, 

[311 

political factors took priority over the historically applied concept of scholarly ability, even though its two most 

[321 T331 
prominent supporters were the Nobel Laureates in Physics Philipp Lenard and Johannes Stark. 
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After Adolf Hitler came to power in 1933, Heisenberg was attacked in the press as a "White Jew' by elements of 

the deutsche Physik (German Physics) movement for his insistence on teaching about the roles of Jewish scientists. 

As a result, he came under investigation by the SS. This was over an attempt to appoint Heisenberg as successor to 

Arnold Sommerfeld at the University of Munich. The issue was resolved in 1938 by Heinrich Himmler, head of the 

SS. While Heisenberg was not chosen as Sommerf eld's successor, he was rehabilitated to the physics community 

during the Third Reich. Nevertheless, supporters of deutsche Physik launched vicious attacks against leading 

theoretical physicists, including Arnold Sommerfeld and Heisenberg. On 29 June 1936, a National Socialist Party 

newspaper published a column attacking Heisenberg. On 15 July 1937, he was attacked in a journal of the SS. This 

ri7i 

was the beginning of what is called the Heisenberg Affair. 

In mid- 1936, Heisenberg presented his theory of cosmic-ray showers in two papers ^ Four more papers^ 37 ^ ^ 
[39] [40] a pp eare( j m me next two years. tl7] t41] 

In June 1939, Heisenberg bought a summer home for his family in Urfeld, in southern Germany. He also traveled to 
the United States in June and July, visiting Samuel Abraham Goudsmit, at the University of Michigan in Ann Arbor. 
However, Heisenberg refused an invitation to emigrate to the United States. He would not see Goudsmit again until 
six years later, when Goudsmit was the chief scientific advisor to the American Operation Alsos at the close of 
World War II. Ironically, Heisenberg would be arrested under Operation Alsos and detained in England under 

r\ tj -i [17] [42] [43] 

Operation Epsilon. 



Matrix Mechanics and the Nobel Prize 



Heisenberg' s paper establishing quantum mechanics ^ has puzzled 

physicists and historians. His methods assume that the reader is 

familiar with Kramers-Heisenberg transition probability calculations. 

The main new idea, noncommuting matrices, is justified only by a 

rejection of unobservable quantities. It introduces the non-commutative 

multiplication of matrices by physical reasoning, based on the 

correspondence principle, despite the fact that Heisenberg was not then 

familiar with the mathematical theory of matrices. The path leading to 

[45] 

these results has been reconstructed in MacKinnon, 1977, and the 




detailed calculations are worked out in Aitchison et al. 



[46] 



Niels Bohr, Werner Heisenberg, and Wolfgang 
Pauli, ca. 1935 



In Copenhagen, Heisenberg and H. Kramers collaborated on a paper on 
dispersion, or the scattering from atoms of radiation whose wavelength is larger than the atoms. They showed that 
the successful formula Kramers had developed earlier could not be based on Bohr orbits, because the transition 
frequencies are based on level spacings which are not constant. The frequencies which occur in the Fourier transform 
of sharp classical orbits, by contrast, are equally spaced. But these results could be explained by a semi-classical 
Virtual State model: the incoming radiation excites the valence, or outer, electron to a virtual state from which it 
decays. In a subsequent paper Heisenberg showed that this virtual oscillator model could also explain the 
polarization of fluorescent radiation. 

These two successes, and the continuing failure of the Bohr-Sommerfeld model to explain the outstanding problem 
of the anomalous Zeeman effect, led Heisenberg to use the virtual oscillator model to try to calculate spectral 
frequencies. The method proved too difficult to immediately apply to realistic problems, so Heisenberg turned to a 
simpler example, the anharmonic oscillator. 

The dipole oscillator consists of a simple harmonic oscillator, which is thought of as a charged particle on a spring, 
perturbed by an external force, like an external charge. The motion of the oscillating charge can be expressed as a 
Fourier series in the frequency of the oscillator. Heisenberg solved for the quantum behavior by two different 
methods. First, he treated the system with the virtual oscillator method, calculating the transitions between the levels 
that would be produced by the external source. 
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He then solved the same problem by treating the anharmonic potential term as a perturbation to the harmonic 
oscillator and using the perturbation methods that he and Born had developed. Both methods led to the same results 
for the first and the very complicated second order correction terms. This suggested that behind the very complicated 
calculations lay a consistent scheme. 

So Heisenberg set out to formulate these results without any explicit dependence on the virtual oscillator model. To 
do this, he replaced the Fourier expansions for the spatial coordinates by matrices, matrices which corresponded to 
the transition coefficients in the virtual oscillator method. He justified this replacement by an appeal to Bohr's 
correspondence principle and the Pauli doctrine that quantum mechanics must be limited to observables. 

On 9 July, Heisenberg gave Born this paper to review and submit for publication. When Born read the paper, he 
recognized the formulation as one which could be transcribed and extended to the systematic language of 
matrices, ^ which he had learned from his study under Jakob Rosanes^ at Breslau University. Born, with the help 
of his assistant and former student Pascual Jordan, began immediately to make the transcription and extension, and 
they submitted their results for publication; the paper was received for publication just 60 days after Heisenberg' s 
paper J 49 ^ A follow-on paper was submitted for publication before the end of the year by all three authors P 0 ^ (A 
brief review of Born's role in the development of the matrix mechanics formulation of quantum mechanics along 
with a discussion of the key formula involving the non-commutivity of the probability amplitudes can be found in an 
article by Jeremy Bernstein, Max Born and the Quantum Theory P 1 ^ A detailed historical and technical account can 
be found in Mehra and Rechenberg's book The Historical Development of Quantum Theory. Volume 3. The 
Formulation of Matrix Mechanics and Its Modifications 1925—1926. ) 

Up until this time, matrices were seldom used by physicists; they were considered to belong to the realm of pure 
mathematics. Gustav Mie had used them in a paper on electrodynamics in 1912 and Born had used them in his work 
on the lattices theory of crystals in 1921. While matrices were used in these cases, the algebra of matrices with their 
multiplication did not enter the picture as they did in the matrix formulation of quantum mechanics. 

Born had learned matrix algebra from Rosanes, as already noted, but Born had also learned Hilbert's theory of 
integral equations and quadratic forms for an infinite number of variables as was apparent from a citation by Born of 
Hilbert's work Grundziige einer allgemeinen Theorie der Linearen Integralgleichungen published in 1912.^ ^ 
Jordan, too was well equipped for the task. For a number of years, he had been an assistant to Richard Courant at 
Gottingen in the preparation of Courant and David Hilbert's book Methoden der mathematischen Physik I, which was 
published in 1924.^ This book, fortuitously, contained a great many of the mathematical tools necessary for the 
continued development of quantum mechanics. In 1926, John von Neumann became assistant to David Hilbert, and 
he would coin the term Hilbert space to describe the algebra and analysis which were used in the development of 
quantum mechanics. t57] t58] 

In 1928, Albert Einstein nominated Heisenberg, Born, and Jordan for the Nobel Prize in Physics, but it was not to 
be. The announcement of the Nobel Prize in Physics for 1932 was delayed until November 1933.^ It was at that 
time that it was announced Heisenberg had won the Prize for 1932 "for the creation of quantum mechanics, the 
application of which has, inter alia, led to the discovery of the allotropic forms of hydrogen" ^ ^ and Erwin 
Schrodinger and Paul Adrien Maurice Dirac shared the 1933 Prize "for the discovery of new productive forms of 
atomic theory"/ 62 ^ One can rightly ask why Born was not awarded the Prize in 1932 along with Heisenberg - 
Bernstein gives some speculations on this matter. One of them is related to Jordan joining the Nazi Party on 1 May 
1933 and becoming a Storm Trooper P 3 ^ Hence, Jordan's Party affiliations and Jordan's links to Born may have 
affected Born's chance at the Prize at that time. Bernstein also notes that when Born won the Prize in 1954, Jordan 
was still alive, and the Prize was awarded for the statistical interpretation of quantum mechanics, attributable alone 
to Born. [64] 

Heisenberg's reaction to Born for Heisenberg receiving the Prize for 1932 and to Born for Born receiving the Prize in 
1954 are also instructive in evaluating whether Born should have shared the Prize with Heisenberg. On 25 November 
1933, Born received a letter from Heisenberg in which he said he had been delayed in writing due to a "bad 
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conscience" that he alone had received the Prize "for work done in Gottingen in collaboration - you, Jordan and I." 
Heisenberg went on to say that Born and Jordan's contribution to quantum mechanics cannot be changed by "a 
wrong decision from the outside. "^ In 1954, Heisenberg wrote an article honoring Max Planck for his insight in 
1900. In the article, Heisenberg credited Born and Jordan for the final mathematical formulation of matrix mechanics 
and Heisenberg went on to stress how great their contributions were to quantum mechanics, which were not 
"adequately acknowledged in the public eye."^ 

The deutsche Physik movement 

On 1 April 1935, the eminent theoretical physicist Arnold Sommerfeld, Heisenberg's doctoral advisor at the 
University of Munich, achieved emeritus status. However, Sommerfeld stayed in his chair during the selection 
process for his successor, which took until 1 December 1939. The process was lengthy due to academic and political 
differences between the Munich Faculty's selection and that of the Reichserziehungsministerium (REM, Reich 
Education Ministry.) and the supporters of Deutsche Physik, which was anti-Semitic and had a bias against 
theoretical physics, especially quantum mechanics and the theory of relativity. In 1935, the Munich Faculty drew up 
a list of candidates to replace Sommerfeld as ordinarius professor of theoretical physics and head of the Institute for 
Theoretical Physics at the University of Munich. There were three names on the list: Werner Heisenberg, who 
received the Nobel Prize in Physics for 1932, Peter Debye, who would receive the Nobel Prize in Chemistry in 1936, 
and Richard Becker - all former students of Sommerfeld. The Munich Faculty was firmly behind these candidates, 
with Heisenberg as their first choice. However, supporters of Deutsche Physik and elements in the REM had their 
own list of candidates and the battle dragged on for over four years. During this time, Heisenberg came under vicious 
attack by the Deutsche Physik supporters. One attack was published in Das Schwarze Korps, the newspaper of the 
Schutzstaffel (SS), headed by Heinrich Himmler. In this, Heisenberg was called a "White Jew" (i.e. an Aryan who 
acts like a Jew) who should be made to "disappear. These attacks were taken seriously, as Jews were violently 
attacked and incarcerated. Heisenberg fought back with an editorial and a letter to Himmler, in an attempt to resolve 
this matter and regain his honour. At one point, Heisenberg's mother visited Himmler's mother. The two women 
knew each other as Heisenberg's maternal grandfather and Himmler's father were rectors and members of a Bavarian 
hiking club. Eventually, Himmler settled the Heisenberg affair by sending two letters, one to SS Gruppenfiihrer 
Reinhard Heydrich and one to Heisenberg, both on 21 July 1938. In the letter to Heydrich, Himmler said Germany 
could not afford to lose or silence Heisenberg as he would be useful for teaching a generation of scientists. To 
Heisenberg, Himmler said the letter came on recommendation of his family and he cautioned Heisenberg to make a 
distinction between professional physics research results and the personal and political attitudes of the involved 
scientists. The letter to Heisenberg was signed under the closing "Mit freundlichem Gruss und, Heil Hitler!" (With 
friendly greetings, Heil Hitler!")^ Overall, the Heisenberg affair was a victory for academic standards and 
professionalism. However, the appointment of Wilhelm Muller to replace Sommerfeld was a political victory over 
academic standards. Muller was not a theoretical physicist, had not published in a physics journal, and was not a 
member of the Deutsche Physikalische Gesellschaft; his appointment was considered a travesty and detrimental to 
educating theoretical physicists J 68 ^ ^ ^ ^ ^ 

During the SS investigation of Heisenberg, the three investigators had training in physics. Heisenberg had 
participated in the doctoral examination of one of them at the Universitat Leipzig. The most influential of the three, 
however, was Johannes Juilfs. During their investigation, they had become supporters of Heisenberg as well as his 
position against the ideological policies of the deutsche Physik movement in theoretical physics and academia. 
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World War II 

In 1939, shortly after the discovery of nuclear fission, the German nuclear energy project, also known as the 
Uranverein (Uranium Club), was begun. Heisenberg was one of the principal scientists leading research and 
development in the project. 

From 15 to 22 September 1941, Heisenberg traveled to German-occupied Copenhagen to lecture and discuss nuclear 
research and theoretical physics with Niels Bohr. The meeting, and specifically what it might reveal about 
Heisenberg's intentions concerning developing nuclear weapons for the Nazi regime, is the subject of the award 
winning Michael Frayn play titled Copenhagen. Documents relating to the Bohr-Heisenberg meeting were released 
in 2002 by the Niels Bohr Archive and by the Heisenberg family ^ 

On 26 February 1942, Heisenberg presented a lecture to Reich officials on energy acquisition from nuclear fission, 
after the Army withdrew most of its funding. t76] The Uranium Club was transferred to the Reich Research Council 
(RFR) in July 1942. On 4 June 1942, Heisenberg was summoned to report to Albert Speer, Germany's Minister of 
Armaments, on the prospects for converting the Uranium Club's research toward developing nuclear weapons. 
During the meeting, Heisenberg told Speer that a bomb could not be built before 1945, and would require significant 
monetary and manpower resources. Five days later, on 9 June 1942, Adolf Hitler issued a decree for the 
reorganization of the RFR as a separate legal entity under the Reich Ministry for Armament and Ammunition; the 
decree appointed Reich Marshall Goring as the president. ^ 

In September 1942, Heisenberg submitted his first paper of a three-part series on the scattering matrix, or S-matrix, 
in elementary particle physics. The first two papers were published in 1943^ ^ and the third in 1944.^ The 
S-matrix described only observables, i.e., the states of incident particles in a collision process, the states of those 
emerging from the collision, and stable bound states; there would be no reference to the intervening states. This was 
the same precedent as he followed in 1925 in what turned out to be the foundation of the matrix formulation of 
quantum mechanics through only the use of observables J ^ 

In February 1943, Heisenberg was appointed to the Chair for Theoretical Physics at the 
Friedrich-Wilhelms-Universitat (today, the Humboldt-Universitat zu Berlin). In April, his election to the Preufiische 
Akademie der Wissenschaften (Prussian Academy of Sciences) was approved. That same month, he moved his 
family to their retreat in Urfeld as Allied bombing increased in Berlin. In the summer, he dispatched the first of his 
staff at the Kaiser-Wilhelm Institut fiir Physik to Hechingen and its neighboring town of Haigerloch, on the edge of 
the Black Forest, for the same reasons. From 18-26 October, he traveled to German-occupied Netherlands. In 
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December 1943, Heisenberg visited German occupied Poland. 

From 24 January to 4 February 1944, Heisenberg traveled to occupied Copenhagen, after the German Army 

confiscated Bohr's Institute of Theoretical Physics. He made a short return trip in April. In December, Heisenberg 

ri7i 

lectured in neutral Switzerland. 

In January 1945, Heisenberg vacated the Kaiser-Wilhelm Institut fiir Physik with about all of his staff for the 

ri7i 

facilities in the Black Forest. 
Uranium Club 

In December 1938, the German chemists Otto Hahn and Fritz Strassmann sent a manuscript to Naturwissenschaften 
reporting they had detected the element barium after bombarding uranium with neutrons; t83] simultaneously, they 
communicated these results to Lise Meitner, who had in July of that year fled to the Netherlands and then went to 
Sweden. t84] Meitner, and her nephew Otto Robert Frisch, correctly interpreted these results as being nuclear 
fission J 85 ^ Frisch confirmed this experimentally on 13 January 1939.^ ^ 

Paul Harteck was director of the physical chemistry department at the University of Hamburg and an advisor to the 
Heereswaffenamt (HWA, Army Ordnance Office). On 24 April 1939, along with his teaching assistant Wilhelm 
Groth, Harteck made contact with the Reichskriegsministerium (RKM, Reich Ministry of War) to alert them to the 
potential of military applications of nuclear chain reactions. Two days earlier, on 22 April 1939, after hearing a 
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colloquium paper by Wilhelm Hanle on the use of uranium fission in a Uranmaschine (uranium machine, i.e., 
nuclear reactor), Georg Joos, along with Hanle, notified Wilhelm Dames, at the Reichserziehungsministerium (REM, 
Reich Ministry of Education), of potential military applications of nuclear energy. The communication was given to 
Abraham Esau, head of the physics section of the Reichsforschungsrat (RFR, Reich Research Council) at the REM. 
On 29 April, a group, organized by Esau, met at the REM to discuss the potential of a sustained nuclear chain 
reaction. The group included the physicists Walther Bothe, Robert Dopel, Hans Geiger, Wolfgang Gentner (probably 
sent by Walther Bothe), Wilhelm Hanle, Gerhard Hoffmann and Georg Joos; Peter Debye was invited, but he did not 
attend. After this, informal work began at the Georg-August University of Gottingen by Joos, Hanle and their 
colleague Reinhold Mannfopff; the group of physicists was known informally as the first Uranverein (Uranium 
Club) and formally as Arb e its g erne ins chaft fiir Kernphysik. The group's work was discontinued in August 1939, 
when the three were called to military training/ 88 ^ ^ ^ ^ 

The second Uranverein began after the Heereswaffenamt (HWA, Army Ordnance Office) squeezed the 
Reichsforschungsrat (RFR, Reich Research Council) out of the Reichserziehungsministerium (REM, Reich Ministry 
of Education) and started the formal German nuclear energy project under military auspices. The second Uranverein 
was formed on 1 September 1939, the day World War II began, and it had its first meeting on 16 September 1939. 
The meeting was organized by Kurt Diebner, advisor to the HWA, and held in Berlin. The invitees included Walther 
Bothe, Siegfried Fliigge, Hans Geiger, Otto Hahn, Paul Harteck, Gerhard Hoffmann, Josef Mattauch and Georg 
Stetter. A second meeting was held soon thereafter and included Klaus Clusius, Robert Dopel, Werner Heisenberg 
and Carl Friedrich von Weizsacker. Also at this time, the Kaiser -Wilhelm InstitutfUr Physik (KWIP, Kaiser Wilhelm 
Institute for Physics, after World War II the Max Planck Institute for Physics), in Berlin-Dahlem, was placed under 
HWA authority, with Diebner as the administrative director, and the military control of the nuclear research 
commenced [90][91][92] 

When it was apparent that the nuclear energy project would not make a decisive contribution to ending the war effort 
in the near term, control of the KWIP was returned in January 1942 to its umbrella organization, the Kaiser -Wilhelm 
Gesellschaft (KWG, Kaiser Wilhelm Society, after World War II the Max-Planck Gesellschaft), and HWA control of 
the project was relinquished to the RFR in July 1942. The nuclear energy project thereafter maintained its 
kriegswichtig (important for the war) designation and funding continued from the military. However, the German 
nuclear power project was then broken down into the following main areas: uranium and heavy water production, 
uranium isotope separation and the Uranmaschine (uranium machine, i.e., nuclear reactor). Also, the project was 
then essentially split up between a number of institutes, where the directors dominated the research and set their own 
research agendas P 0 ^ ^ ^ The dominant personnel and facilities were the following m P 5 ^ ^ ^ 

• InstitutfUr Physik (Walther Bothe) of the Kaiser-Wilhelm InstitutfUr medizinische Forschung (KWImF, Kaiser 
Wilhelm Institute for Medical Research), 

Institute for Physical Chemistry (Klaus Clusius) at the Ludwig Maximilian University of Munich, 
HWA Versuchsstelle (testing station) in Gottow (Kurt Diebner), 
Kaiser-Wilhelm-Institut fUr Chemie (Otto Hahn), 

Physical Chemistry Department (Paul Harteck) of the University of Hamburg, 
Kaiser-Wilhelm-Institut fUr Physik (Werner Heisenberg), 

Second Experimental Physics Institute (Hans Kopfermann) at the Georg-August University of Gottingen, 
Auergesellschaft (Nikolaus Riehl), and 

//. Physikalisches Institut (Georg Stetter) at the University of Vienna. 

Heisenberg was appointed director-in-residence of the KWIP on 1 July 1942, as Peter Debye was still officially the 
director and on leave in the United States; Debye had gone on leave as he was a citizen of The Netherlands and had 
refused to become a German citizen when the HWA took administrative control of the KWIP. Heisenberg still also 
had his department of physics at the University of Leipzig where work was done for the Uranverein by Robert Dopel 
and his wife Klara Dopel. During the period Kurt Diebner administered the KWIP under the HWA program, 
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considerable personal and professional animosity developed between Diebner and the Heisenberg inner circle - 
Heisenberg, Karl Wirtz, and Carl Friedrich von WeizsackerJ 17 ^ ^ 

The point in 1942, when the army relinquished its control of the German nuclear energy project, was the zenith of 
the project relative to the number of personnel devoting time to the effort. There were only about 70 scientists 
working on the project, with about 40 devoting more than half their time to nuclear fission research. After this, the 
number of scientists working on applied nuclear fission diminished dramatically. Many of the scientists not working 
with the main institutes stopped working on nuclear fission and devoted their efforts to more pressing war related 
work. 1 "" 1 

Over time, the HWA and then the RFR controlled the German nuclear energy project. The most influential people in 
the project were Kurt Diebner, Abraham Esau, Walther Gerlach and Erich Schumann. Schumann was one of the 
most powerful and influential physicists in Germany. Schumann was director of the Physics Department II at the 
Frederick William University (later, University of Berlin), which was commissioned and funded by the 
Oberkommando des He ere s (OKW, Army High Command) to conduct physics research projects. He was also head 
of the research department of the HWA, assistant secretary of the Science Department of the OKW and 
Bevollmachtiger (plenipotentiary) for high explosives. Diebner, throughout the life of the nuclear energy project, had 
more control over nuclear fission research than did Walther Bothe, Klaus Clusius, Otto Hahn, Paul Harteck or 
Werner Heisenberg J 100 ^ ^ 101 ^ 

1945: Operation Alsos and Operation Epsilon 

Operation Alsos was an Allied effort commanded by the Russian-American Colonel Boris T. Pash. He reported 
directly to General Leslie Groves, commander of the Manhattan Engineer District, which was developing atomic 
weapons for the United States. The chief scientific advisor to Operation Alsos was the physicist Samuel Abraham 
Goudsmit. Goudsmit was selected for this task because of his knowledge of physics, he spoke German, and he 
personally knew a number of the German scientists working on the German nuclear energy project. He also knew 
little of the Manhattan Project, so, if he were captured, he would have little intelligence value to the Germans. The 
objectives of Operation Alsos were to determine if the Germans had an atomic bomb program and to exploit German 
atomic related facilities, intellectual materials, materiel resources, and scientific personnel for the benefit of the 
United States. Personnel on this operation generally swept into areas which had just come under control of the Allied 
military forces, but sometimes they operated in areas still under control by German forces J 102 ^ ^ 103 ^ '- 104 -' 

Berlin had been a location of many German scientific research facilities. To limit casualties and loss of equipment, 

many of these facilities were dispersed to other locations in the latter years of the war. The Kaiser-Wilhelm-Institut 

fiir Physik (KWIP, Kaiser Wilhelm Institute for Physics) had mostly been moved in 1943 and 1944 to Hechingen and 

its neighboring town of Haigerloch, on the edge of the Black Forest, which eventually became the French occupation 

zone. This move and a little luck allowed the Americans to take into custody a large number of German scientists 

associated with nuclear research. The only section of the institute which remained in Berlin was the low-temperature 

physics section, headed by Ludwig Bewilogua (1906-83), who was in charge of the exponential uranium pileJ 105 ^ 
[106] 

Nine of the prominent German scientists who published reports in Kernphysikalische Forschungsberichte as 
members of the Uranverein^ 101 ^ were picked up by Operation Alsos and incarcerated in England under Operation 
Epsilon: Erich Bagge, Kurt Diebner, Walther Gerlach, Otto Hahn, Paul Harteck, Werner Heisenberg, Horst 
Korsching, Carl Friedrich von Weizsacker and Karl Wirtz. Also, incarcerated was Max von Laue, although he had 
nothing to do with the nuclear energy project. Goudsmit, the chief scientific advisor to Operation Alsos, thought von 
Laue might be beneficial to the postwar rebuilding of Germany and would benefit from the high level contacts he 
would have in England. tl08] 

Heisenberg had been captured and arrested by Colonel Pash at Heisenberg's retreat in Urfeld, on 3 May 1945, in 
what was a true alpine-type operation in territory still under control by German forces. He was taken to Heidelberg, 
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where, on 5 May, he met Goudsmit for the first time since the Ann Arbor visit in 1939. Germany surrendered just 
two days later. Heisenberg would not see his family again for eight months. Heisenberg was moved across France 
and Belgium and flown to England on 3 July 1945. [109] [110] 

The 10 German scientists were held at Farm Hall in England. The facility had been a safe house of the British 
foreign intelligence MI6. During their detention, their conversations were recorded. Conversation thought to be of 
intelligence value were transcribed and translated into English. The transcripts were released in 1992. Bernstein has 
published an annotated version of the transcripts in his book Hitler's Uranium Club: The Secret Recordings at Farm 
Hall, along with an introduction to put them in perspective. A complete, unedited publication of the British version 
of the reports appeared as Operation Epsilon: The Farm Hall Transcripts, which was published in 1993 by the 
Institute of Physics in Bristol and by the University of California Press in the United States/ 112 ^ ^ 113 ^ ^ 114 ^ 

Post 1945 

On 3 January 1946, the 10 Operation Epsilon detainees were transported to Alswede, Germany, which was in the 
British occupation zone. Heisenberg settled in Gottingen, also in the British zone. In July, he was named director of 
the Kaiser-Wilhelm Institut fiir Physik (KWIP, Kaiser Wilhelm Institute for Physics), then located in Gottingen. 
Shortly thereafter, it was renamed the Max-Planck Institut fiir Physik, in honor of Max Planck and to assuage 
political objections to the continuation of the institute. Heisenberg was its director until 1958. In 1958, the institute 
was moved to Munich, expanded, and renamed Max-Planck-Institut fiir Physik und Astrophysik (MPIFA). 
Heisenberg was its director from 1960 to 1970; in the interim, Heisenberg and the astrophysicist Ludwig Biermann 
were co-directors. Heisenberg resigned his directorship of the MPIFA on 31 December 1970. Upon the move to 
Munich, Heisenberg also became an ordentlicher Professor (ordinarius professor) at the University of Munich. ^ tl7] 

Just as the Americans did with Operation Alsos, the Russians inserted special search teams into Germany and 
Austria in the wake of their troops. Their objective, under the Russian Alsos, was also the exploitation of German 
atomic related facilities, intellectual materials, materiel resources and scientific personnel for the benefit of the 
Soviet Union. One of the German scientists recruited under this Russian operation was the nuclear physicist Heinz 
Pose, who was made head of Laboratory V in Obninsk. When he returned to Germany on a recruiting trip for his 
laboratory, Pose wrote a letter to the Werner Heisenberg inviting him to work in Russia. The letter lauded the 
working conditions in Russia and the available resources, as well as the favorable attitude of the Russians towards 
German scientists. A courier hand delivered the recruitment letter, dated 18 July 1946, to Heisenberg; Heisenberg 
politely declined in a return letter to PoseJ 115] tll6] 

In 1947, Heisenberg presented lectures in Cambridge, Edinburgh and Bristol. Heisenberg also contributed to the 
understanding of the phenomenon of superconductivity with a paper in 1947^ 17 ^ and two papers in 1948^ 118 ^ ^ 119 ^ 
one of them with Max von LaueJ 17 ^ ^ 120 ^ 

In the period shortly after World War II, Heisenberg briefly returned to the subject of his doctoral thesis, turbulence. 
Three papers were published in 1948^ 121 ^ ^ 122 ^ ^ 123 ^ and one in 1950. ^ ^ 124 ^ 

In the post-war period, Heisenberg continued his interests in cosmic-ray showers with considerations on multiple 
production of mesons. He published three papers [125] [126] [127] in 1949, two [128] [129] in 1952, and one [130] in 
1955. [131] 

On 9 March 1949, the Deutsche Forschungsrat (German Research Council) was established by the Max-Planck 
Gesellschaft (MPG, Max Planck Society, successor organization to the Kaiser-Wilhelm Gesellschaft. Heisenberg was 
appointed president of the Deutsche Forschungsrat. In 1951, the organization was fused with the Notgemeinschaft 
der Deutschen Wissenschaft (NG, Emergency Association of German Science) and that same year renamed the 
Deutsche Forschungsgemeinschaft (DFG, German Research Foundation). With the merger, Heisenberg was 
appointed to the presidium. tl7] tl32] tl33] 

In 1952, Heisenberg served as the chairman of the Commission for Atomic Physics of the DFG. Also that year, he 
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headed the German delegation to the European Council for Nuclear Research. 
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In 1953, Heisenberg was appointed president of the Alexander von Humboldt-Stiftung by Konrad Adenauer. 
Heisenberg served until 1975. Also, from 1953, Heisenberg's theoretical work concentrated on the unified field 
theory of elementary particles ^ ^ 

In late 1955 to early 1956, Heisenberg gave the Gifford Lectures at St Andrews University, in Scotland, on the 
intellectual history of physics. The lectures were later published as Physics and Philosophy: The Revolution in 
Modem Science J 134 -' 

During 1956 and 1957, Heisenberg was the chairman of the Arbeitskreis Kernphysik (Nuclear Physics Working 
Group) of the Fachkommission II "Forschung und Nachwuchs" (Commission II "Research and Growth") of the 
Deutschen Atomkommission (DAtK, German Atomic Energy Commission). Other members of the Nuclear Physics 
Working Group in both 1956 and 1957 were: Walther Bothe, Hans Kopfermann (vice-chairman), Fritz Bopp, 
Wolfgang Gentner, Otto Haxel, Willibald Jentschke, Heinz Maier-Liebnitz, Josef Mattauch, Wolfgang Riezler, 
Wilhelm Walcher and Carl Friedrich von Weizsacker. Wolfgang Paul was also a member of the group during 
1957. [135] 

In 1957, Heisenberg was a signatory of the manifesto of the Gottinger Achtzehn (Gottingen Eighteen)/ 136 ^ 

From 1957, Heisenberg was interested in plasma physics and the process of nuclear fusion. He also collaborated with 
the International Institute of Atomic Physics in Geneva. He was a member of the Institute's Scientific Policy 
Committee, and for several years was the Committee's chairman. 

In 1973, Heisenberg gave a lecture at Harvard University on the historical development of the concepts of quantum 
theory. 1 '- ,71 

On 24 March 1973, Heisenberg gave a speech before the Catholic Academy of Bavaria, accepting the Romano 

Guardini Prize. An English translation of its title is "Scientific and Religious Truth." And its stated goal was "In what 

follows, then, we shall first of all deal with the unassailability and value of scientific truth, and then with the much 

wider field of religion, of which - so far as the Christian religion is concerned - Guardini himself has so persuasively 

written; finally - and this will be the hardest part to formulate - we shall speak of the relationship of the two 
ri38i 

truths." A more detail insight in Planck and Heisenberg on religion has been discussed by Wilfried Schroder in " 
Natural science and religion" (Bremen 1999, Science edition) and Wilfried Schroder " Naturerkenntnis und 
Religion" Bremen, science edition 2008). 

Personal life 

In January 1937 Heisenberg met Elisabeth Schumacher at a private music recital. Elisabeth was the daughter of a 
well-known Berlin economics professor. They were married on 29 April. The fraternal twins, Maria and Wolfgang, 
were born to them in January 1938, whereupon, Wolfgang Pauli congratulated Heisenberg on his "pair creation" - a 
word play on a process from elementary particle physics, pair production. They had five more children over the next 
12 years: Barbara, Christine, Jochen, Martin and Verena. Jochen became a physics professor at the University of 
New Hampshire/ 1391 [140] 

Heisenberg enjoyed classical music and was an accomplished pianist. ^ He was a Lutheran. ^ 141 ] 

Heisenberg died of cancer of the kidneys and gall bladder at his home, on 1 February 1976^ 142 ^ The next evening, 
his colleagues and friends walked in remembrance from the Institute of Physics to his home and each put a candle 
near the front doorJ 143 ^ 
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Honors and awards 

Heisenberg was awarded a number of honors: 

• Honorary doctorates from the University of Bruxelles, the Technological University of Karlsruhe, and the 
University of Budapest. 

• Order of Merit of Bavaria 

• Romano Guardini Prize^ 138 ^ 

• Grand Cross for Federal Service with Star 

• Knight of the Order of Merit (Peace Class) 

• Fellow of the Royal Society of London 

• Member of the Academies of Sciences of Gottingen, Bavaria, Saxony, Prussia, Sweden, Rumania, Norway, 
Spain, The Netherlands, Rome (Pontifical), the Deutsche Akademie der Naturforscher Leopoldina (Halle), the 
Accademia dei Lincei (Rome), and the American Academy of Sciences. 

• 1932-Nobel Prize in Physics "for the creation of quantum mechanics, the application of which has, inter alia, led 
to the discovery of the allotropic forms of hydrogen "J 61 ^ 

• 1933- Max-Planck-Medaille of the Deutsche Physikalische Gesellschaft 



Internal reports 

The following reports were published in Kernphysikalische Forschungsberichte {Research Reports in Nuclear 
Physics), an internal publication of the German Uranverein. The reports were classified Top Secret, they had very 
limited distribution, and the authors were not allowed to keep copies. The reports were confiscated under the Allied 
Operation Alsos and sent to the United States Atomic Energy Commission for evaluation. In 1971, the reports were 
declassified and returned to Germany. The reports are available at the Karlsruhe Nuclear Research Center and the 
American Institute of Physics P 44 ^ ^ 145 ^ 

• Robert Dopel, K. Dopel, and Werner Heisenberg Bestimmung der Diffusionslange thermischer Neutronen in 
Praparat 38 [U6] G-22 (5 December 1940) 

• Robert Dopel, K. Dopel, and Werner Heisenberg Bestimmung der Diffusionslange thermischer Neutronen in 
schwerem Wasser G-23 (7 August 1940) 

• Werner Heisenberg Die Moglichkeit der technischer Energiegewinnung aus der Uranspaltung G-39 (6 December 
1939) 

• Werner Heisenberg Bericht uber die Moglichkeit technischer Energiegewinnung aus der Uranspaltung (II) G-40 
(29 February 1940) 

• Robert Dopel, K. Dopel, and Werner Heisenberg Versuche mit Schichtenanordnungen von D^D und 38 G-75 (28 
October 1941) 

• Werner Heisenberg Uber die Moglichkeit der Energieerzeugung mit Hilfe des Isotops 238 G-92 (1941) 

• Werner Heisenberg Bericht uber Versuche mit Schichtenanordnungen von Praparat 38 und Paraffin am Kaiser 
Wilhelm Institut fur Physik in Berlin-Dahlem G-93 (May 1941) 

• Fritz Bopp, Erich Fischer, Werner Heisenberg, Carl-Friedrich von Weizsacker, and Karl Wirtz Untersuchungen 
mit neuen Schichtenanordnungen aus U-metall und Paraffin G-127 (March 1942) 

• Robert Dopel Bericht uber Unfalle beim Umgang mit Uranmetall G-135 (9 July 1942) 

• Werner Heisenberg Bemerkungen zu dem geplanten halbtechnischen Versuch mit 1,5 to D^O und 3 to 38-Metall 
G-161 (31 July 1942) 

• Werner Heisenberg, Fritz Bopp, Erich Fischer, Carl-Friedrich von Weizsacker, and Karl Wirtz Messungen an 
Schichtenanordnungen aus 38-Metall und Paraffin G-162 (30 October 1942) 

• Robert Dopel, K. Dopel, and Werner Heisenberg Der experimentelle Nachweis der effektiven 
Neutronenvermehrung in einem Kugel-Schichten-System aus D^O und Uran-Metall G-136 (July 1942) 

• Werner Heisenberg Die Energiegewinnung aus der Atomkernspaltung G-217 (6 May 1943) 
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• Fritz Bopp, Walther Bothe, Erich Fischer, Erwin Fiinfer, Werner Heisenberg, O. Ritter, and Karl Wirtz Bericht 
Uber einen Versuch mit 1.5 to D£) und U und 40 cm Kohlerilckstreumantel (B7) G-300 (3 January 1945) 
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Unified Field Theory 
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Albert Einstein (pronounced /'selb9rt 'ainstain/; German: ['albBt 'ainjtain] ( 4)) listen); 14 March 1879- 18 April 
1955) was a theoretical physicist, philosopher and author who is widely regarded as one of the most influential and 
iconic scientists and intellectuals of all time. A German-Swiss Nobel laureate, Einstein is often regarded as the father 
of modern physics. He received the 1921 Nobel Prize in Physics "for his services to theoretical physics, and 
especially for his discovery of the law of the photoelectric effect". 

Near the beginning of his career, Einstein thought that Newtonian mechanics was no longer enough to reconcile the 

laws of classical mechanics with the laws of the electromagnetic field. This led to the development of his special 

theory of relativity. He realized, however, that the principle of relativity could also be extended to gravitational 

fields, and with his subsequent theory of gravitation in 1916, he published a paper on the general theory of relativity. 

He continued to deal with problems of statistical mechanics and quantum theory, which led to his explanations of 

particle theory and the motion of molecules. He also investigated the thermal properties of light which laid the 

foundation of the photon theory of light. In 1917, Einstein applied the general theory of relativity to model the 

Mi 

structure of the universe as a whole. 

On the eve of World War II in 1939, he personally alerted President Franklin D. Roosevelt that Germany might be 
developing an atomic weapon, and recommended that the U.S. begin uranium procurement and nuclear research. As 
a result, Roosevelt advocated such research, leading to the creation of the top secret Manhattan Project, and the U.S. 
becoming the first and only country to possess nuclear weapons during the war. Days before his death, however, 
Einstein signed the Russell-Einstein Manifesto, that highlighted the dangers posed by the military usage of nuclear 
energy. 

Einstein published more than 300 scientific along with over 150 non-scientific works, and received honorary 

Mi 

doctorate degrees in science, medicine and philosophy from many European and American universities; he also 
wrote about various philosophical and political subjects such as socialism, international relations and the existence of 
God.^ His great intelligence and originality has made the word "Einstein" synonymous with genius ^ 

Biography 

Early life and education 
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Albert Einstein was born in Ulm, in the Kingdom of Wurttemberg in the German 
Empire on 14 March 1879. His father was Hermann Einstein, a salesman and 
engineer. His mother was Pauline Einstein (nee Koch). In 1880, the family 
moved to Munich, where his father and his uncle founded Elektrotechnische 
Fabrik J. Einstein & Cie, a company that manufactured electrical equipment 



based on direct current 



[V] 



The Einsteins were non-observant Jews. Their son attended a Catholic 
elementary school from the age of five until ten. 1 Although Einstein had early 
speech difficulties, he was a top student in elementary school ^ 

His father once showed him a pocket compass; Einstein realized that there must 
be something causing the needle to move, despite the apparent "empty space" P ^ 
As he grew, Einstein built models and mechanical devices for fun and began to 
show a talent for mathematics. In 1889, Max Talmud (later changed to Max 
Talmey) introduced the ten-year old Einstein to key texts in science, mathematics 
and philosophy, including Immanuel Kant's Critique of Pure Reason and Euclid's 
Elements (which Einstein called the "holy little geometry book"). Talmud was 
a poor Jewish medical student from Poland. The Jewish community arranged for 
Talmud to take meals with the Einsteins each week on Thursdays for six years. 
During this time Talmud wholeheartedly guided Einstein through many secular 
educational interests. 11 3] tl4] 




Albert Einstein in 1893 (age 14). 



In 1894, his father's company failed: direct current (DC) lost the War of Currents to alternating current (AC). In 
search of business, the Einstein family moved to Italy, first to Milan and then, a few months later, to Pavia. When the 
family moved to Pavia, Einstein stayed in Munich to finish his studies at the Luitpold Gymnasium. His father 
intended for him to pursue electrical engineering, but Einstein clashed with authorities and resented the school's 
regimen and teaching method. He later wrote that the spirit of learning and creative thought were lost in strict rote 
learning. In the spring of 1895, he withdrew to join his family in Pavia, convincing the school to let him go by using 
a doctor's note. 1 During this time, Einstein wrote his first scientific work, "The Investigation of the State of Aether 
in Magnetic Fields" J 15 ^ 

Einstein applied directly to the Eidgenossische Polytechnische Schule (ETH) in Zurich, Switzerland. Lacking the 

requisite Matura certificate, he took an entrance examination, which he failed, although he got exceptional marks in 

mathematics and physics J 16 ^ The Einsteins sent Albert to Aarau, in northern Switzerland to finish secondary 
T71 

school. While lodging with the family of Professor Jost Winteler, he fell in love with the family's daughter, Marie. 
(His sister Maja later married the Wintelers' son Paul.) In Aarau, Einstein studied Maxwell's electromagnetic 
theory. At age 17, he graduated, and, with his father's approval, renounced his citizenship in the German Kingdom of 
Wurttemberg to avoid military service, and in 1896 he enrolled in the four year mathematics and physics teaching 
diploma program at the Polytechnic in Zurich. Marie Winteler moved to Olsberg, Switzerland for a teaching post. 
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Einstein's future wife, Mileva Marie, also enrolled at the Polytechnic that same year, the only woman among the six 
students in the mathematics and physics section of the teaching diploma course. Over the next few years, Einstein 
and Marie's friendship developed into romance, and they read books together on extra-curricular physics in which 
Einstein was taking an increasing interest. In 1900 Einstein was awarded the Zurich Polytechnic teaching diploma, 

MOT 

but Marie failed the examination with a poor grade in the mathematics component, theory of functions. There 

have been claims that Marie collaborated with Einstein on his celebrated 1905 papers, ^ ^ but historians of 

r211 T221 T231 T241 

physics who have studied the issue find no evidence that she made any substantive contributions. 

Marriages and children 

In early 1902, Einstein and Mileva Marie had a daughter they named Lieserl in their correspondence, who was born 
in Novi Sad where Marie's parents lived. Her full name is not known, and her fate is uncertain after 1903. 

Einstein and Marie married in January 1903. In May 1904, the couple's first son, Hans Albert Einstein, was born in 
Bern, Switzerland. Their second son, Eduard, was born in Zurich in July 1910. In 1914, Einstein moved to Berlin, 
while his wife remained in Zurich with their sons. Marie and Einstein divorced on 14 February 1919, having lived 
apart for five years. 

Einstein married Elsa Lowenthal (nee Einstein) on 2 June 1919, after having had a relationship with her since 1912. 

She was his first cousin maternally and his second cousin paternally. In 1933, they emigrated permanently to the 

[27] 

United States. In 1935, Elsa Einstein was diagnosed with heart and kidney problems and died in December 1936. 



Patent office 




Left to right: Conrad Habicht, Maurice Solovine 
and Einstein, who founded the Olympia Academy 




After graduating, Einstein spent almost two frustrating years searching 
for a teaching post, but a former classmate's father helped him secure a 
job in Bern, at the Federal Office for Intellectual Property, the patent 
office, as an assistant examiner. 1 He evaluated patent applications 
for electromagnetic devices. In 1903, Einstein's position at the Swiss 
Patent Office became permanent, although he was passed over for 
promotion until he "fully mastered machine technology" P 9 ^ 

Much of his work at the patent office related to questions about 
transmission of electric signals and electrical-mechanical 
synchronization of time, two technical problems that show up 
conspicuously in the thought experiments that eventually led Einstein 
to his radical conclusions about the nature of light and the fundamental 
connection between space and timeJ 30 ^ 

With a few friends he met in Bern, Einstein started a small discussion 
group, self-mockingly named "The Olympia Academy", which met 
regularly to discuss science and philosophy. Their readings included 
the works of Henri Poincare, Ernst Mach, and David Hume, which 
influenced his scientific and philosophical outlook. 



Einstein's home in Bern 
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Academic career 

In 1901, Einstein had a paper on the capillary forces of a straw 
published in the prestigious Annalen der Physik. On 30 April 1905, 
he completed his thesis, with Alfred Kleiner, Professor of Experimental 
Physics, serving as pro-forma advisor. Einstein was awarded a PhD by 
the University of Zurich. His dissertation was entitled "A New 
Determination of Molecular Dimensions". That same year, which has 
been called Einstein's annus mirabilis or "miracle year", he published 
four groundbreaking papers, on the photoelectric effect, Brownian 
motion, special relativity, and the equivalence of matter and energy, 
which were to bring him to the notice of the academic world. 

By 1908, he was recognized as a leading scientist, and he was appointed 
lecturer at the University of Berne. The following year, he quit the 
patent office and the lectureship to take the position of physics 
docent at the University of Zurich. He became a full professor at 
Karl-Ferdinand University in Prague in 1911. In 1914, he returned to 
Germany after being appointed director of the Kaiser Wilhelm Institute 
for Physics (1914-1932) and a professor at the Humboldt University 
of Berlin, although with a special clause in his contract that freed him 
from most teaching obligations. He became a member of the Prussian Academy of Sciences. In 1916, Einstein was 
appointed president of the German Physical Society (1916-1918). [35] [36] 

In 1911, he had calculated that, based on his new theory of general relativity, light from another star would be bent 
by the Sun's gravity. That prediction was claimed confirmed by observations made by a British expedition led by Sir 
Arthur Eddington during the solar eclipse of May 29, 1919. International media reports of this made Einstein world 
famous. On 7 November 1919, the leading British newspaper The Times printed a banner headline that read: 
"Revolution in Science - New Theory of the Universe - Newtonian Ideas Overthrown". (Much later, questions 
were raised whether the measurements were accurate enough to support Einstein's theory.) 

In 1921, Einstein was awarded the Nobel Prize in Physics. Because relativity was still considered somewhat 
controversial, it was officially bestowed for his explanation of the photoelectric effect. He also received the Copley 
Medal from the Royal Society in 1925. 




Einstein's official 1921 portrait after receiving 
the Nobel Prize in Physics. 



Travels abroad 

Einstein visited New York City for the first time on 2 April 1921. When asked where he got his scientific ideas, 
Einstein explained that he believed scientific work best proceeds from an examination of physical reality and a 
search for underlying axioms, with consistent explanations that apply in all instances and avoid contradicting each 
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other. He also recommended theories with visualizable results. (Einstein 1954) 

In 1922, he traveled throughout Asia and later to Palestine, as part of a six-month excursion and speaking tour. His 

travels included Singapore, Ceylon, and Japan, where he gave a series of lectures to thousands of Japanese. His first 

lecture in Tokyo lasted four hours, after which he met the emperor and empress at the Imperial Palace where 
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thousands came to watch. Einstein later gave his impressions of the Japanese in a letter to his sons: ' "Of all the 

T391 

people I have met, I like the Japanese most, as they are modest, intelligent, considerate, and have a feel for art.' 

:308 

On his return voyage, he also visited Palestine for twelve days in what would become his only visit to that region. 
"He was greeted with great British pomp, as if he were a head of state rather than a theoretical physicist", writes 
Isaacson. This included a cannon salute upon his arrival at the residence of the British high commissioner, Sir 
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Herbert Samuel. During one reception given to him, the building was "stormed by throngs who wanted to hear him". 
In Einstein's talk to the audience, he expressed his happiness over the event: 

I consider this the greatest day of my life. Before, I have always found something to regret in the Jewish 
soul, and that is the forgetfulness of its own people. Today, I have been made happy by the sight of the 
Jewish people learning to recognize themselves and to make themselves recognized as a force in the 
world. [40]:308 



Emigration from Germany 

In 1933, Einstein was compelled to immigrate to the United States due 
to the rise to power of the Nazis under Germany's new chancellor, 
Adolf HitlerJ 41 ^ While visiting American universities in April, 1933, 
he learned that the new German government had passed a law barring 
Jews from holding any official positions, including teaching at 
universities. A month later, the Nazi book burnings occurred, with 
Einstein's works being among those burnt, and Nazi propaganda 
minister Joseph Goebbels proclaimed, "Jewish intellectualism is 
dead." t40] Einstein also learned that his name was on a list of 
assassination targets, with a "$5,000 bounty on his head". One German 
magazine included him in a list of enemies of the German regime with 
the phrase, "not yet hanged" J 40 ^ ^ 




Next to Oliver Locker-Lampson, being protected 
in Norfolk, England, after escaping Nazi 
Germany in 1933 



Among other German scientists forced to flee were fourteen Nobel 
laureates and twenty-six of the sixty professors of theoretical physics 
in the country. Among the other scientists who left Germany, or the 

other countries it came to dominate, were Edward Teller, Niels Bohr, Enrico Fermi, Otto Stern, Victor Weisskopf, 
Hans Bethe, and Lise Meitner, many of whom made certain that the Allies would develop nuclear weapons first, 
before the Nazis J 40 ^ With so many other Jewish scientists now forced by circumstances to live in America, often 
working side by side, Einstein wrote to a friend, "For me the most beautiful thing is to be in contact with a few fine 
Jews — a few millennia of a civilized past do mean something after all." In another letter he writes, "In my whole life 
I have never felt so Jewish as now."'" 40 -' 
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He took up a position at the Institute for Advanced Study at Princeton, New Jersey, an affiliation that lasted until 
his death in 1955. There, he tried unsuccessfully to develop a unified field theory and to refute the accepted 
interpretation of quantum physics. He and Kurt Godel, another Institute member, became close friends. They would 
take long walks together discussing their work. His last assistant was Bruria Kaufman, who later became a renowned 
physicist. 

World War II and the Manhattan Project 

In the summer of 1939, a few months before the beginning of World War II, Einstein was persuaded to write a letter 
to President Franklin D. Roosevelt and warn him that Nazi Germany might be developing an atomic bomb. The 
letter, written with the help of Hungarian emigre physicist Leo Szilard, gave the letter more prestige, with Einstein 
also recommending that the U.S. begin uranium enrichment and nuclear research. According to F.G. Gosling of the 
U.S. Department of Energy, Einstein, Szilard, and other refugees including Edward Teller and Eugene Wigner, 
"regarded it as their responsibility to alert Americans to the possibility that German scientists might win the race to 
build an atomic bomb, and to warn that Hitler would be more than willing to resort to such a weapon. " t44] 

British columnist Ambrose Evans-Pritchard notes, however, that Washington at first "brushed off with disbelief" the 
fears they expressed. He then describes how quickly Roosevelt changed his mind: 
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Albert Einstein interceded through the Belgian queen mother, eventually getting a personal envoy into 
the Oval Office. Roosevelt initially fobbed him off. He listened more closely at a second meeting over 
breakfast the next day, then made up his mind within minutes. "This needs action," he told his military 
aide. It was the birth of the Manhattan Project.'- 45 -' 

Gosling adds that "the President was a man of considerable action once he had chosen a direction," and believed that 
the U.S. "could not take the risk of allowing Hitler" to possess nuclear bombs. ^ Other weapons historians agree 
that the letter was "arguably the key stimulus for the U.S. adoption of serious investigations into nuclear weapons on 
the eve of the U.S. entry into World War II". As a result of Einstein's letter, and his meetings with Roosevelt, the 
U.S. entered the "race" to develop the bomb first, drawing on its "immense material, financial, and scientific 
resources". It became the only country to develop an atomic bomb during World War II as a result of its Manhattan 
Project J 46 ^ Einstein said to his old friend, Linus Pauling, in 1954, the last year of his life: "I made one great mistake 
in my life — when I signed the letter to President Roosevelt recommending that atom bombs be made; but there was 
some justification — the danger that the Germans would make them..."^ 47 ' 

U.S. citizenship 

Einstein became an American citizen in 1940. Not long after settling 
into his career at Princeton, he expressed his appreciation of the 
"meritocracy" in American culture when compared to Europe. 
According to Isaacson, he recognized the "right of individuals to say 
and think what they pleased", without social barriers, and as result, the 
individual was "encouraged" to be more creative, a trait he valued from 
his own early education. Einstein writes: 

What makes the new arrival devoted to this country is the 
democratic trait among the people. No one humbles himself 
before another person or class. . . American youth has the good 

fortune not to have its outlook troubled by outworn traditions. ^ 

:432 




As a member of the NAACP at Princeton who campaigned for the civil 
rights of African Americans, Einstein corresponded with civil rights 
activist W. E. B. Du Bois, and in 1946 Einstein called racism 
America's "worst disease" J 48] He later stated, "Race prejudice has 
unfortunately become an American tradition which is uncritically 
handed down from one generation to the next. The only remedies are 
enlightenment and education" J 49 ^ 

After the death of Israel's first president, Chaim Weizmann, in 

November 1952, Prime Minister David Ben-Gurion offered Einstein 

the position of President of Israel, a mostly ceremonial postJ 50 ^ The 

offer was presented by Israel's ambassador in Washington, Abba Eban, 

who explained that the offer "embodies the deepest respect which the 
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Jewish people can repose in any of its sons". ' However, Einstein 
declined, and wrote in his response that he was "deeply moved", and 
"at once saddened and ashamed" that he could not accept it: 




Einstein with David Ben Gurion, 1951 
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All my life I have dealt with objective matters, hence I lack both the natural aptitude and the experience to 
deal properly with people and to exercise official function. I am the more more distressed over these 
circumstances because my relationship with the Jewish people became my strongest human tie once I achieved 
complete clarity about our precarious position among the nations of the worldP 9 ^ 522 ^ '- 51 -' 



Death 

On April 17, 1955, Albert Einstein experienced internal bleeding 
caused by the rupture of an abdominal aortic aneurysm, which had 
previously been reinforced surgically by Dr. Rudolph Nissen in 
1948. He took the draft of a speech he was preparing for a 
television appearance commemorating the State of Israel's seventh 
anniversary with him to the hospital, but he did not live long enough to 
complete it. Einstein refused surgery, saying: "I want to go when I 
want. It is tasteless to prolong life artificially. I have done my share, it 
is time to go. I will do it elegantly.' He died in Princeton Hospital 
early the next morning at the age of 76, having continued to work until 
near the end. 
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The New York World-Telegram announces 
Einstein's death on April 18, 1955. 



Einstein's remains were cremated and his ashes were scattered around the grounds of the Institute for Advanced 
Study P 5 ^ t56] During the autopsy, the pathologist of Princeton Hospital, Thomas Stoltz Harvey, removed Einstein's 
brain for preservation, without the permission of his family, in hope that the neuroscience of the future would be able 
to discover what made Einstein so intelligent. 1 



Scientific career 

Throughout his life, Einstein published hundreds of books and articles. 
Most were about physics, but a few expressed leftist political opinions 
about pacifism, socialism, and zionism.^ ^ In addition to the work he 
did by himself he also collaborated with other scientists on additional 
projects including the Bose-Einstein statistics, the Einstein refrigerator 
and others. [58] 

Physics in 1900 

Einstein's early papers all come from attempts to demonstrate that 
atoms exist and have a finite nonzero size. At the time of his first paper 
in 1902, it was not yet completely accepted by physicists that atoms 
were real, even though chemists had good evidence ever since Antoine 
Lavoisier's work a century earlier. The reason physicists were skeptical 
was because no 19th century theory could fully explain the properties 
of matter from the properties of atoms. 




Albert Einstein in 1904. 



Ludwig Boltzmann was a leading 19th century atomist physicist, who 

had struggled for years to gain acceptance for atoms. Boltzmann had given an interpretation of the laws of 
thermodynamics, suggesting that the law of entropy increase is statistical. In Boltzmann's way of thinking, the 
entropy is the logarithm of the number of ways a system could be configured inside. The reason the entropy goes up 

is only because it is more likely for a system to go from a special state with only a few possible internal 
configurations to a more generic state with many. While Boltzmann's statistical interpretation of entropy is 
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universally accepted today, and Einstein believed it, at the turn of the 20th century it was a minority position. 

The statistical idea was most successful in explaining the properties of gases. James Clerk Maxwell, another leading 
atomist, had found the distribution of velocities of atoms in a gas, and derived the surprising result that the viscosity 
of a gas should be independent of density. Intuitively, the friction in a gas would seem to go to zero as the density 
goes to zero, but this is not so, because the mean free path of atoms becomes large at low densities. A subsequent 
experiment by Maxwell and his wife confirmed this surprising prediction. Other experiments on gases and vacuum, 
using a rotating slitted drum, showed that atoms in a gas had velocities distributed according to Maxwell's 
distribution law. 

In addition to these successes, there were also inconsistencies. Maxwell noted that at cold temperatures, atomic 
theory predicted specific heats that are too large. In classical statistical mechanics, every spring-like motion has 
thermal energy kT on average at temperature T, so that the specific heat of every spring is Boltzmann's constant & . 
A monatomic solid with N atoms can be thought of as N little balls representing N atoms attached to each other in a 
box grid with 3N springs, so the specific heat of every solid is 3Nk , a result which became known as the 
Dulong-Petit law. This law is true at room temperature, but not for colder temperatures. At temperatures near zero, 
the specific heat goes to zero. 

Similarly, a gas made up of a molecule with two atoms can be thought of as two balls on a spring. This spring has 
energy k^T at high temperatures, and should contribute an extra & B to the specific heat. It does at temperatures of 
about 1000 degrees, but at lower temperature, this contribution disappears. At zero temperature, all other 
contributions to the specific heat from rotations and vibrations also disappear. This behavior was inconsistent with 
classical physics. 

The most glaring inconsistency was in the theory of light waves. Continuous waves in a box can be thought of as 
infinitely many spring-like motions, one for each possible standing wave. Each standing wave has a specific heat of 
Al,, so the total specific heat of a continuous wave like light should be infinite in classical mechanics. This is 
obviously wrong, because it would mean that all energy in the universe would be instantly sucked up into light 
waves, and everything would slow down and stop. 

These inconsistencies led some people to say that atoms were not physical, but mathematical. Notable among the 
skeptics was Ernst Mach, whose positivist philosophy led him to demand that if atoms are real, it should be possible 
to see them directly P 9 ^ Mach believed that atoms were a useful fiction, that in reality they could be assumed to be 
infinitesimally small, that Avogadro's number was infinite, or so large that it might as well be infinite, and was 
infinitesimally small. Certain experiments could then be explained by atomic theory, but other experiments could 
not, and this is the way it will always be. 

Einstein opposed this position. Throughout his career, he was a realist. He believed that a single consistent theory 
should explain all observations, and that this theory would be a description of what was really going on, underneath 
it all. So he set out to show that the atomic point of view was correct. This led him first to thermodynamics, then to 
statistical physics, and to the theory of specific heats of solids. 

In 1905, while he was working in the patent office, the leading German language physics journal Annalen der Physik 
published four of Einstein's papers. The four papers eventually were recognized as revolutionary, and 1905 became 
known as Einstein's "Miracle Year", and the papers as the Annus Mirabilis Papers. 
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Thermodynamic fluctuations and statistical physics 

Einstein's earliest papers were concerned with thermodynamics. He wrote a paper establishing a thermodynamic 
identity in 1902, and a few other papers which attempted to interpret phenomena from a statistical atomic point of 
view. 

His research in 1903 and 1904 was mainly concerned with the effect of finite atomic size on diffusion phenomena. 
As in Maxwell's work, the finite nonzero size of atoms leads to effects which can be observed. This research, and the 
thermodynamic identity, were well within the mainstream of physics in his time. They would eventually form the 
content of his PhD thesis. ^ 

His first major result in this field was the theory of thermodynamic fluctuations. When in equilibrium, a system has a 
maximum entropy and, according to the statistical interpretation, it can fluctuate a little bit. Einstein pointed out that 
the statistical fluctuations of a macroscopic object, like a mirror suspended on spring, would be completely 
determined by the second derivative of the entropy with respect to the position of the mirror. 

Searching for ways to test this relation, his great breakthrough came in 1905. The theory of fluctuations, he realized, 
would have a visible effect for an object which could move around freely. Such an object would have a velocity 
which is random, and would move around randomly, just like an individual atom. The average kinetic energy of the 
object would be kgT, and the time decay of the fluctuations would be entirely determined by the law of friction. 

The law of friction for a small ball in a viscous fluid like water was discovered by George Stokes. He showed that 
for small velocities, the friction force would be proportional to the velocity, and to the radius of the particle (see 
Stokes' law). This relation could be used to calculate how far a small ball in water would travel due to its random 
thermal motion, and Einstein noted that such a ball, of size about a micrometre, would travel about a few 
micrometres per second. This motion could be easily detected with a microscope and indeed, as Brownian motion, 
had actually been observed by the botanist Robert Brown. Einstein was able to identify this motion with that 
predicted by his theory. Since the fluctuations which give rise to Brownian motion are just the same as the 
fluctuations of the velocities of atoms, measuring the precise amount of Brownian motion using Einstein's theory 
would show that Boltzmann's constant is non-zero and would measure Avogadro's number. 

These experiments were carried out a few years later by Jean Baptiste Perrin, and gave a rough estimate of 
Avogadro's number consistent with the more accurate estimates due to Max Planck's theory of blackbody light and 
Robert Millikan's measurement of the charge of the electron J 61 ^ Unlike the other methods, Einstein's required very 
few theoretical assumptions or new physics, since it was directly measuring atomic motion on visible grains. 

Einstein's theory of Brownian motion was the first paper in the field of statistical physics. It established that 
thermodynamic fluctuations were related to dissipation. This was shown by Einstein to be true for time-independent 
fluctuations, but in the Brownian motion paper he showed that dynamical relaxation rates calculated from classical 
mechanics could be used as statistical relaxation rates to derive dynamical diffusion laws. These relations are known 
as Einstein relations. 

The theory of Brownian motion was the least revolutionary of Einstein's Annus mirabilis papers, but it is the most 
frequently cited, and had an important role in securing the acceptance of the atomic theory by physicists. 
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Thought experiments and a-priori physical principles 

Einstein's thinking underwent a transformation in 1905. He had come to understand that quantum properties of light 
mean that Maxwell's equations were only an approximation. He knew that new laws would have to replace these, but 
he did not know how to go about finding those laws. He felt that guessing formal relations would not go anywhere. 

So he decided to focus on a-priori principles instead, which are statements about physical laws which can be 
understood to hold in a very broad sense even in domains where they have not yet been shown to apply. A well 
accepted example of an a-priori principle is rotational invariance. If a new force is discovered in physics, it is 
assumed to be rotationally invariant almost automatically, without thought. Einstein sought new principles of this 
sort, to guide the production of physical ideas. Once enough principles are found, then the new physics will be the 
simplest theory consistent with the principles and with previously known laws. 

The first general a-priori principle he found was the principle of relativity, that uniform motion is indistinguishable 
from rest. This was understood by Hermann Minkowski to be a generalization of rotational invariance from space to 
space-time. Other principles postulated by Einstein and later vindicated are the principle of equivalence and the 
principle of adiabatic invariance of the quantum number. Another of Einstein's general principles, Mach's principle, 
is fiercely debated, and whether it holds in our world or not is still not definitively established. 

The use of a-priori principles is a distinctive unique signature of Einstein's early work, and has become a standard 
tool in modern theoretical physics. 

Special relativity 

His 1905 paper on the electrodynamics of moving bodies introduced his theory of special relativity, which showed 
that the observed independence of the speed of light on the observer's state of motion required fundamental changes 
to the notion of simultaneity. Consequences of this include the time- space frame of a moving body slowing down 
and contracting (in the direction of motion) relative to the frame of the observer. This paper also argued that the idea 
of a luminiferous aether - one of the leading theoretical entities in physics at the time - was superfluous J 62 ^ In his 
paper on mass-energy equivalence, which had previously been considered to be distinct concepts, Einstein deduced 
from his equations of special relativity what has been called the 20th century's best-known equation: E = mc 2 } 63 ^ t64] 
This equation suggests that tiny amounts of mass could be converted into huge amounts of energy and presaged the 
development of nuclear power. ^ Einstein's 1905 work on relativity remained controversial for many years, but was 
accepted by leading physicists, starting with Max Planck. ^ ^ 

Photons 

In a 1905 paper, ^ Einstein postulated that light itself consists of localized particles (quanta). Einstein's light quanta 
were nearly universally rejected by all physicists, including Max Planck and Niels Bohr. This idea only became 
universally accepted in 1919, with Robert Millikan's detailed experiments on the photoelectric effect, and with the 
measurement of Compton scattering. 

Einstein's paper on the light particles was almost entirely motivated by thermodynamic considerations. He was not at 
all motivated by the detailed experiments on the photoelectric effect, which did not confirm his theory until fifteen 
years later. Einstein considers the entropy of light at temperature T, and decomposes it into a low-frequency part and 
a high-frequency part. The high-frequency part, where the light is described by Wien's law, has an entropy which 
looks exactly the same as the entropy of a gas of classical particles. 

Since the entropy is the logarithm of the number of possible states, Einstein concludes that the number of states of 
short wavelength light waves in a box with volume V is equal to the number of states of a group of localizable 
particles in the same box. Since (unlike others) he was comfortable with the statistical interpretation, he confidently 
postulates that the light itself is made up of localized particles, as this is the only reasonable interpretation of the 
entropy. 
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This leads him to conclude that each wave of frequency / is associated with a collection of photons with energy hf 
each, where h is Planck's constant. He does not say much more, because he is not sure how the particles are related to 
the wave. But he does suggest that this idea would explain certain experimental results, notably the photoelectric 
effect. 1691 

Quantized atomic vibrations 

Einstein continued his work on quantum mechanics in 1906, by explaining the specific heat anomaly in solids. This 
was the first application of quantum theory to a mechanical system. Since Planck's distribution for light oscillators 
had no problem with infinite specific heats, the same idea could be applied to solids to fix the specific heat problem 
there. Einstein showed in a simple model that the hypothesis that solid motion is quantized explains why the specific 
heat of a solid goes to zero at zero temperature. 

Einstein's model treats each atom as connected to a single spring. Instead of connecting all the atoms to each other, 
which leads to standing waves with all sorts of different frequencies, Einstein imagined that each atom was attached 
to a fixed point in space by a spring. This is not physically correct, but it still predicts that the specific heat is 3Nk, 
since the number of independent oscillations stays the same. 

Einstein then assumes that the motion in this model is quantized, according to the Planck law, so that each 
independent spring motion has energy which is an integer multiple of hf, where f is the frequency of oscillation. 
With this assumption, he applied Boltzmann's statistical method to calculate the average energy of the spring. The 
result was the same as the one that Planck had derived for light: for temperatures where k^Tis much smaller than hf 
the motion is frozen, and the specific heat goes to zero. 

So Einstein concluded that quantum mechanics would solve the main problem of classical physics, the specific heat 
anomaly. The particles of sound implied by this formulation are now called phonons. Because all of Einstein's 
springs have the same stiffness, they all freeze out at the same temperature, and this leads to a prediction that the 
specific heat should go to zero exponentially fast when the temperature is low. The solution to this problem is to 
solve for the independent normal modes individually, and to quantize those. Then each normal mode has a different 
frequency, and long wavelength vibration modes freeze out at colder temperatures than short wavelength ones. This 
was done by Peter Debye, and after this modification Einstein's quantization method reproduced quantitatively the 
behavior of the specific heats of solids at low temperatures. 

This work was the foundation of condensed matter physics. 

Adiabatic principle and action-angle variables 

Throughout the 1910s, quantum mechanics expanded in scope to cover many different systems. After Ernest 
Rutherford discovered the nucleus and proposed that electrons orbit like planets, Niels Bohr was able to show that 
the same quantum mechanical postulates introduced by Planck and developed by Einstein would explain the discrete 
motion of electrons in atoms, and the periodic table of the elements. 

Einstein contributed to these developments by linking them with the 1898 arguments Wilhelm Wien had made. Wien 
had shown that the hypothesis of adiabatic invariance of a thermal equilibrium state allows all the blackbody curves 
at different temperature to be derived from one another by a simple shifting process. Einstein noted in 1911 that the 
same adiabatic principle shows that the quantity which is quantized in any mechanical motion must be an adiabatic 
invariant. Arnold Sommerfeld identified this adiabatic invariant as the action variable of classical mechanics. The 
law that the action variable is quantized was the basic principle of the quantum theory as it was known between 1900 
and 1925. 
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Wave-particle duality 

Although the patent office promoted Einstein to Technical Examiner Second Class in 1906, he had not given up on 
academia. In 1908, he became a privatdozent at the University of BernJ 70] In "iiber die Entwicklung unserer 
Anschauungen iiber das Wesen und die Konstitution der Strahlung" ("The Development of Our Views on the 
Composition and Essence of Radiation"), on the quantization of light, and in an earlier 1909 paper, Einstein showed 
that Max Planck's energy quanta must have well-defined momenta and act in some respects as independent, 
point-like particles. This paper introduced the photon concept (although the name photon was introduced later by 
Gilbert N. Lewis in 1926) and inspired the notion of wave-particle duality in quantum mechanics. 

Theory of critical opalescence 

Einstein returned to the problem of thermodynamic fluctuations, giving a treatment of the density variations in a 
fluid at its critical point. Ordinarily the density fluctuations are controlled by the second derivative of the free energy 
with respect to the density. At the critical point, this derivative is zero, leading to large fluctuations. The effect of 
density fluctuations is that light of all wavelengths is scattered, making the fluid look milky white. Einstein relates 
this to Raleigh scattering, which is what happens when the fluctuation size is much smaller than the wavelength, and 
which explains why the sky is blue. 



Zero-point energy 

Einstein's physical intuition led him to note that Planck's oscillator 
energies had an incorrect zero point. He modified Planck's hypothesis 
by stating that the lowest energy state of an oscillator is equal to V^/, to 
half the energy spacing between levels. This argument, which was made 
in 1913 in collaboration with Otto Stern, was based on the 
thermodynamics of a diatomic molecule which can split apart into two 
free atoms. 

Principle of equivalence 

In 1907, while still working at the patent office, Einstein had what he 
would call his "happiest thought". He realized that the principle of 
relativity could be extended to gravitational fields. He thought about the 
case of a uniformly accelerated box not in a gravitational field, and 
noted that it would be indistinguishable from a box sitting still in an 
unchanging gravitational field. He used special relativity to see that 
the rate of clocks at the top of a box accelerating upward would be 
faster than the rate of clocks at the bottom. He concludes that the rates 
of clocks depend on their position in a gravitational field, and that the 
difference in rate is proportional to the gravitational potential to first 
approximation. 

Although this approximation is crude, it allowed him to calculate the deflection of light by gravity, and show that it 
is nonzero. This gave him confidence that the scalar theory of gravity proposed by Gunnar Nordstrom was incorrect. 
But the actual value for the deflection that he calculated was too small by a factor of two, because the approximation 
he used doesn't work well for things moving at near the speed of light. When Einstein finished the full theory of 
general relativity, he would rectify this error and predict the correct amount of light deflection by the sun. 

From Prague, Einstein published a paper about the effects of gravity on light, specifically the gravitational redshift 
and the gravitational deflection of light. The paper challenged astronomers to detect the deflection during a solar 




Albert Einstein 



628 



eclipse. German astronomer Erwin Finlay-Freundlich publicized Einstein's challenge to scientists around the 
world. [74] 

Einstein thought about the nature of the gravitational field in the years 1909-1912, studying its properties by means 
of simple thought experiments. A notable one is the rotating disk. Einstein imagined an observer making 
experiments on a rotating turntable. He noted that such an observer would find a different value for the mathematical 
constant pi than the one predicted by Euclidean geometry. The reason is that the radius of a circle would be 
measured with an uncontracted ruler, but, according to special relativity, the circumference would seem to be longer 
because the ruler would be contracted. 

Since Einstein believed that the laws of physics were local, described by local fields, he concluded from this that 
spacetime could be locally curved. This led him to study Riemannian geometry, and to formulate general relativity in 
this language. 

Hole argument and Entwurf theory 

While developing general relativity, Einstein became confused about the gauge in variance in the theory. He 
formulated an argument that led him to conclude that a general relativistic field theory is impossible. He gave up 
looking for fully generally co variant tensor equations, and searched for equations that would be invariant under 
general linear transformations only. 

In June, 1913 the Entwurf ("draft") theory was the result of these investigations. As its name suggests, it was a 
sketch of a theory, with the equations of motion supplemented by additional gauge fixing conditions. Simultaneously 
less elegant and more difficult than general relativity, after more than two years of intensive work Einstein 
abandoned the theory in November, 1915 after realizing that the hole argument was mistaken. 

General relativity 

In 1912, Einstein returned to Switzerland to accept a professorship at his alma mater, the ETH. Once back in Zurich, 
he immediately visited his old ETH classmate Marcel Grossmann, now a professor of mathematics, who introduced 
him to Riemannian geometry and, more generally, to differential geometry. On the recommendation of Italian 
mathematician Tullio Levi-Civita, Einstein began exploring the usefulness of general covariance (essentially the use 
of tensors) for his gravitational theory. For a while Einstein thought that there were problems with the approach, but 
he later returned to it and, by late 1915, had published his general theory of relativity in the form in which it is used 
today F 6 ^ This theory explains gravitation as distortion of the structure of spacetime by matter, affecting the inertial 
motion of other matter. During World War I, the work of Central Powers scientists was available only to Central 
Powers academics, for national security reasons. Some of Einstein's work did reach the United Kingdom and the 
United States through the efforts of the Austrian Paul Ehrenfest and physicists in the Netherlands, especially 1902 
Nobel Prize-winner Hendrik Lorentz and Willem de Sitter of Leiden University. After the war ended, Einstein 
maintained his relationship with Leiden University, accepting a contract as an Extraordinary Professor, for ten 
years, from 1920 to 1930, he travelled to Holland regularly to lecture. 

In 1917, several astronomers accepted Einstein 's 1911 challenge from Prague. The Mount Wilson Observatory in 
California, U.S., published a solar spectroscopic analysis that showed no gravitational redshift.' 78 ^ In 1918, the Lick 
Observatory, also in California, announced that it too had disproved Einstein's prediction, although its findings were 
not published. [79] 
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Eddington's photograph of a solar eclipse, which 
confirmed Einstein's theory that light "bends". 



movement. 



[83] [84] 



However, in May 1919, a team led by the British astronomer Arthur 
Stanley Eddington claimed to have confirmed Einstein's prediction of 
gravitational deflection of starlight by the Sun while photographing a 
solar eclipse with dual expeditions in Sobral, northern Brazil, and 
Principe, a west African island Nobel laureate Max Born praised 
general relativity as the "greatest feat of human thinking about 
nature" fellow laureate Paul Dirac was quoted saying it was 

roil 

"probably the greatest scientific discovery ever made". L ° 1J The 
international media guaranteed Einstein's global renown. 

There have been claims that scrutiny of the specific photographs taken 

on the Eddington expedition showed the experimental uncertainty to be 

comparable to the same magnitude as the effect Eddington claimed to 

have demonstrated, and that a 1962 British expedition concluded that 

[371 

the method was inherently unreliable. The deflection of light during 
a solar eclipse was confirmed by later, more accurate observations J 82 ^ 
Some resented the newcomer's fame, notably among some German 
physicists, who later started the Deutsche Physik (German Physics) 



Cosmology 

In 1917, Einstein applied the General theory of relativity to model the structure of the universe as a whole. He 
wanted the universe to be eternal and unchanging, but this type of universe is not consistent with relativity. To fix 
this, Einstein modified the general theory by introducing a new notion, the cosmological constant. With a positive 
cosmological constant, the universe could be an eternal static sphere^ 85 ^ 

Einstein believed a spherical static universe is philosophically preferred, because it would obey Mach's principle. He 
had shown that general relativity incorporates Mach's principle to a certain extent in frame dragging by 
gravitomagnetic fields, but he knew that Mach's idea would not work if space goes on forever. In a closed universe, 
he believed that Mach's principle would hold. 

Mach's principle has generated much controversy over the years. 
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Modern quantum theory 

In 1917, at the height of his work on relativity, Einstein published an 
article in Physikalische Zeitschrift that proposed the possibility of 
stimulated emission, the physical process that makes possible the 
maser and the laser J 86 ^ This article showed that the statistics of 
absorption and emission of light would only be consistent with 
Planck's distribution law if the emission of light into a mode with n 
photons would be enhanced statistically compared to the emission of 
light into an empty mode. This paper was enormously influential in the 
later development of quantum mechanics, because it was the first paper 
to show that the statistics of atomic transitions had simple laws. 
Einstein discovered Louis de Broglie's work, and supported his ideas, 
which were received skeptically at first. In another major paper from 
this era, Einstein gave a wave equation for de Broglie waves, which 
Einstein suggested was the Hamilton-Jacobi equation of mechanics. 
This paper would inspire Schrodinger's work of 1926. 




Einstein in his office at the University of Berlin. 



Bose-Einstein statistics 

In 1924, Einstein received a description of a statistical model from Indian physicist Satyendra Nath Bose, based on a 
counting method that assumed that light could be understood as a gas of indistinguishable particles. Einstein noted 
that Bose's statistics applied to some atoms as well as to the proposed light particles, and submitted his translation of 
Bose's paper to the Zeitschrift filr Physik. Einstein also published his own articles describing the model and its 
implications, among them the Bose-Einstein condensate phenomenon that some particulates should appear at very 

TR71 

low temperatures. It was not until 1995 that the first such condensate was produced experimentally by Eric Allin 
Cornell and Carl Wieman using ultra-cooling equipment built at the NIST-JILA laboratory at the University of 
Colorado at Boulder J 88 ^ Bose-Einstein statistics are now used to describe the behaviors of any assembly of bosons. 
Einstein's sketches for this project may be seen in the Einstein Archive in the library of the Leiden University.^ 

Energy momentum pseudotensor 

General relativity includes a dynamical spacetime, so it is difficult to see how to identify the conserved energy and 
momentum. Noether's theorem allows these quantities to be determined from a Lagrangian with translation 
invariance, but general covariance makes translation invariance into something of a gauge symmetry. The energy 
and momentum derived within general relativity by Noether's presecriptions do not make a real tensor for this 
reason. 

Einstein argued that this is true for fundamental reasons, because the gravitational field could be made to vanish by a 
choice of coordinates. He maintained that the non-covariant energy momentum pseudotensor was in fact the best 
description of the energy momentum distribution in a gravitational field. This approach has been echoed by Lev 
Landau and Evgeny Lifshitz, and others, and has become standard. 

The use of non-covariant objects like pseudotensors was heavily criticized in 1917 by Erwin Schrodinger and others. 
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Unified field theory 

Following his research on general relativity, Einstein entered into a series of attempts to generalize his geometric 
theory of gravitation, which would allow the explanation of electromagnetism. In 1950, he described his "unified 
field theory" in a Scientific American article entitled "On the Generalized Theory of Gravitation".^ Although he 
continued to be lauded for his work, Einstein became increasingly isolated in his research, and his efforts were 
ultimately unsuccessful. In his pursuit of a unification of the fundamental forces, Einstein ignored some mainstream 
developments in physics, most notably the strong and weak nuclear forces, which were not well understood until 
many years after his death. Mainstream physics, in turn, largely ignored Einstein's approaches to unification. 
Einstein's dream of unifying other laws of physics with gravity motivates modern quests for a theory of everything 
and in particular string theory, where geometrical fields emerge in a unified quantum-mechanical setting. 

Wormholes 

Einstein collaborated with others to produce a model of a wormhole. His motivation was to model elementary 
particles with charge as a solution of gravitational field equations, in line with the program outlined in the paper "Do 
Gravitational Fields play an Important Role in the Constitution of the Elementary Particles?". These solutions cut 
and pasted Schwarzschild black holes to make a bridge between two patches. 

If one end of a wormhole was positively charged, the other end would be negatively charged. These properties led 
Einstein to believe that pairs of particles and antiparticles could be described in this way. 

Einstein-Cartan theory 

In order to incorporate spinning point particles into general relativity, the affine connection needed to be generalized 
to include an antisymmetric part, called the torsion. This modification was made by Einstein and Cartan in the 1920s. 

Equations of motion 

The theory of general relativity has a fundamental law - the Einstein equations which describe how space curves, 
the geodesic equation which describes how particles move may be derived from the Einstein equations. 

Since the equations of general relativity are non-linear, a lump of energy made out of pure gravitational fields, like a 
black hole, would move on a trajectory which is determined by the Einstein equations themselves, not by a new law. 
So Einstein proposed that the path of a singular solution, like a black hole, would be determined to be a geodesic 
from general relativity itself. 

This was established by Einstein, Infeld and Hoffmann for pointlike objects without angular momentum, and by Roy 
Kerr for spinning objects. 

Einstein's controversial beliefs in physics 

In addition to his well-accepted results, some of Einstein's views are regarded as controversial: 

• In the special relativity paper (in 1905), Einstein noted that, given a specific definition of the word "force" (a 
definition which he later agreed was not advantageous), and if we choose to maintain (by convention) the 
equation mass x acceleration = force, then one arrives at m/(l-i; 2 /c 2 )as the expression for the transverse mass of 
a fast moving particle. This differs from the accepted expression today, because, as noted in the footnotes to 
Einstein's paper added in the 1913 reprint, "it is more to the point to define force in such a way that the laws of 
energy and momentum assume the simplest form", as was done, for example, by Max Planck in 1906, who gave 
the now familiar expression m/^/l-v 2 /c 2 for the transverse mass. As Miller points out, this is equivalent to the 
transverse mass predictions of both Einstein and Lorentz. Einstein had commented already in the 1905 paper that 
"With a different definition of force and acceleration, we should naturally obtain other expressions for the masses. 
This shows that in comparing different theories... we must proceed very cautiously." t90] 
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• Einstein published (in 1922) a qualitative theory of superconductivity based on the vague idea of electrons shared 
in orbits. This paper predated modern quantum mechanics, and today is regarded as being incorrect. The current 
theory of low temperature superconductivity was only worked out in 1957, thirty years after the establishing of 
modern quantum mechanics. However, even today, superconductivity is not well understood, and alternative 
theories continue to be put forward, especially to account for high- temperature superconductors. 

• After introducing the concept of gravitational waves in 1917, Einstein subsequently entertained doubts about 
whether they could be physically realized. In 1937 he published a paper saying that the focusing properties of 
geodesies in general relativity would lead to an instability which causes plane gravitational waves to collapse in 
on themselves. While this is true to a certain extent in some limits, because gravitational instabilities can lead to a 
concentration of energy density into black holes, for plane waves of the type Einstein and Rosen considered in 
their paper, the instabilities are under control. Einstein retracted this position a short time later. 

• Einstein denied several times that black holes could form. In 1939 he published a paper that argues that a star 
collapsing would spin faster and faster, spinning at the speed of light with infinite energy well before the point 
where it is about to collapse into a black hole. This paper received no citations, and the conclusions are well 
understood to be wrong. Einstein's argument itself is inconclusive, since he only shows that stable spinning 
objects have to spin faster and faster to stay stable before the point where they collapse. But it is well understood 
today (and was understood well by some even then) that collapse cannot happen through stationary states the way 
Einstein imagined. Nevertheless, the extent to which the models of black holes in classical general relativity 
correspond to physical reality remains unclear, and in particular the implications of the central singularity implicit 
in these models are still not understood. Efforts to conclusively prove the existence of event horizons have still 
not been successful. 

• Closely related to his rejection of black holes, Einstein believed that the exclusion of singularities might restrict 
the class of solutions of the field equations so as to force solutions compatible with quantum mechanics, but no 
such theory has ever been found. 

• In the early days of quantum mechanics, Einstein tried to show that the uncertainty principle was not valid, but by 
1927 he had become convinced that it was valid. 

• In the EPR paper, Einstein argued that quantum mechanics cannot be a complete realistic and local representation 
of phenomena, given specific definitions of "realism", "locality", and "completeness". The modern consensus is 
that Einstein's concept of realism is too restrictive. 

• Einstein himself considered the introduction of the cosmological term in his 1917 paper founding cosmology as a 
"blunder". The theory of general relativity predicted an expanding or contracting universe, but Einstein wanted 
a universe which is an unchanging three dimensional sphere, like the surface of a three dimensional ball in four 
dimensions. He wanted this for philosophical reasons, so as to incorporate Mach's principle in a reasonable way. 
He stabilized his solution by introducing a cosmological constant, and when the universe was shown to be 
expanding, he retracted the constant as a blunder. This is not really much of a blunder - the cosmological constant 
is necessary within general relativity as it is currently understood, and it is widely believed to have a nonzero 
value today. 

• Einstein did not immediately appreciate the value of Minkowski's four-dimensional formulation of special 
relativity, although within a few years he had adopted it as the basis for his theory of gravitation. 

• Finding it too formal, Einstein believed that Heisenberg's matrix mechanics was incorrect. He changed his mind 
when Schrodinger and others demonstrated that the formulation in terms of the Schrodinger equation, based on 
Einstein's wave-particle duality was equivalent to Heisenberg's matrices. 
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Collaboration with other scientists 

In addition to long time collaborators Leopold Infeld, Nathan Rosen, Peter Bergmann and others, Einstein also had 
some one-shot collaborations with various scientists. 

Einstein-de Haas experiment 

Einstein and De Haas demonstrated that magnetization is due to the motion of electrons, nowadays known to be the 
spin. In order to show this, they reversed the magnetization in an iron bar suspended on a torsion pendulum. They 
confirmed that this leads the bar to rotate, because the electron's angular momentum changes as the magnetization 
changes. This experiment needed to be sensitive, because the angular momentum associated with electrons is small, 
but it definitively established that electron motion of some kind is responsible for magnetization. 

Schrodinger gas model 

Einstein suggested to Erwin Schrodinger that he might be able to reproduce the statistics of a Bose-Einstein gas by 
considering a box. Then to each possible quantum motion of a particle in a box associate an independent harmonic 
oscillator. Quantizing these oscillators, each level will have an integer occupation number, which will be the number 
of particles in it. 

This formulation is a form of second quantization, but it predates modern quantum mechanics. Erwin Schrodinger 
applied this to derive the thermodynamic properties of a semiclassical ideal gas. Schrodinger urged Einstein to add 
his name as co-author, although Einstein declined the invitation. 1 

Einstein refrigerator 

In 1926, Einstein and his former student Leo Szilard co-invented (and in 1930, patented) the Einstein refrigerator. 
This absorption refrigerator was then revolutionary for having no moving parts and using only heat as an input. 
On 11 November 1930, U.S. Patent 1781541 t94] was awarded to Albert Einstein and Leo Szilard for the refrigerator. 
Their invention was not immediately put into commercial production, as the most promising of their patents were 
quickly bought up by the Swedish company Electrolux to protect its refrigeration technology from competition J 94 -' 

Bohr versus Einstein 

In the 1920s, quantum mechanics developed into a more complete theory. 
Einstein was unhappy with the Copenhagen interpretation of quantum theory 
developed by Niels Bohr and Werner Heisenberg, both in its outcomes and its 
instrumentalist methodology, Einstein being a scientific realist. In this 
interpretation, quantum phenomena are inherently probabilistic, with definite 
states resulting only upon interaction with classical systems. A public debate 
between Einstein and Bohr followed, lasting on and off for many years 
(including during the Solvay Conferences). Einstein formulated thought 
experiments against the Copenhagen interpretation, which were all rebutted by 
Bohr. In a 1926 letter to Max Born, Einstein wrote: "I, at any rate, am convinced 
that He [God] does not throw dice." [95] 

Einstein was never satisfied by what he perceived to be quantum theory's 
intrinsically incomplete description of nature, and in 1935 he further explored the 
issue in collaboration with Boris Podolsky and Nathan Rosen, noting that the 

theory seems to require non-local interactions; this is known as the EPR paradox P 6 ^ The EPR experiment has since 
been performed, with results confirming quantum theory's predictions. Repercussions of the Einstein-Bohr 
debate have found their way into philosophical discourse. 




Einstein and Niels Bohr, 1925 
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Einstein-Podolsky-Rosen paradox 

In 1935, Einstein returned to the question of quantum mechanics. He considered how a measurement on one of two 
entangled particles would affect the other. He noted, along with his collaborators, that by performing different 
measurements on the distant particle, either of position or momentum, different properties of the entangled partner 
could be discovered without disturbing it in any way. 

He then used a hypothesis of local realism to conclude that the other particle had these properties already 
determined. The principle he proposed is that if it is possible to determine what the answer to a position or 
momentum measurement would be, without in any way disturbing the particle, then the particle actually has values 
of position or momentum. 

This principle distilled the essence of Einstein's objection to quantum mechanics. As a physical principle, it has since 
been shown to be incompatible with experiments. 



Political views 



Einstein flouted the ascendant Nazi movement and later tried to be a 
voice of moderation in the tumultuous formation of the State of 
Israel J 98 ^ Fred Jerome in his Einstein on Israel and Zionism: His 
Provocative Ideas About the Middle East argues that Einstein was a 
Cultural Zionist who supported the idea of a Jewish homeland but 
opposed the establishment of a Jewish state in Palestine "with borders, 
an army, and a measure of temporal power." Instead, he preferred a 
bi-national state with "continuously functioning, mixed, administrative, 
economic, and social organizations."^ 99 ^ ^ 100 ^ However Ami Isseroff in 
his article Was Einstein a Zionist, argues that Einstein supported the 
recognition of the State of Israel and declared it "the fulfillment of our 
dream" when President Harry Truman recognize Israel in May 1948 
and in presidential election 1948 Einstein supported Henry A. 
Wallace's Progressive Party which advocate pro-Soviet and pro-Israel 
foreign policy 




[101] [102] 



Albert Einstein, seen here with his wife Elsa 
Einstein and Zionist leaders, including future 
President of Israel Chaim Weizmann, his wife Dr. 

Vera Weizmann, Menahem Ussishkin, and 
Ben-Zion Mossinson on arrival in New York City 
in 1921. 



Throughout the November Revolution in Germany Einstein signed an appeal for the foundation of a nationwide 
liberal and democratic party, ^ 103 ^ ^ 104 ^ which was published in the Berliner Tageblatt on 16 November 1918,^ 105 ^ and 
became a member of the German Democratic Party J 106 ^ 

In his article Why Socialism?,^ 107 ' published in 1949 in the Monthly Review, Einstein described a chaotic capitalist 
society, a source of evil to be overcome, as the "predatory phase of human development". He came to the following 
conclusion: 

I am convinced there is only one way to eliminate these grave evils [capitalism], namely through the 
establishment of a socialist economy, accompanied by an educational system which would be oriented 
toward social goals. In such an economy, the means of production are owned by society itself and are 
utilized in a planned fashion. A planned economy, which adjusts production to the needs of the 
community, would distribute the work to be done among all those able to work and would guarantee a 
livelihood to every man, woman, and child. The education of the individual, in addition to promoting his 
own innate abilities, would attempt to develop in him a sense of responsibility for his fellow men in 
place of the glorification of power and success in our present society. 11071 

He braved anti-communist politics and resistance to the civil rights movement in the United States. On the floor of 
the US Congress, Einstein was accused by John E. Rankin of Mississippi of being a "foreign-born agitator" who 
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sought "to further the spread of Communism throughout the world" J 108 ^ He also participated in the 1927 congress of 
the League against Imperialism in Brussels. tl09] 

After World War II, as enmity between the former allies became a serious issue, Einstein wrote, "I do not know how 
the third World War will be fought, but I can tell you what they will use in the Fourth - rocks !" [110] (Einstein 1949) 
With Albert Schweitzer and Bertrand Russell, Einstein lobbied to stop nuclear testing and future bombs. Days before 
his death, Einstein signed the Russell-Einstein Manifesto, which led to the Pugwash Conferences on Science and 
World Affairs. [111] 

Einstein was a member of several civil rights groups, including the Princeton chapter of the NAACP. When the aged 
W. E. B. Du Bois was accused of being a Communist spy, Einstein volunteered as a character witness, and the case 
was dismissed shortly afterward. Einstein's friendship with activist Paul Robeson, with whom he served as co-chair 
of the American Crusade to End Lynching, lasted twenty years P 12 ^ 

Einstein said "Politics is for the moment, equation for the eternity."^ 1 He declined the presidency of Israel in 
1952. [114] 



Religious views 

The question of scientific determinism gave rise to questions about Einstein's position on theological determinism, 
and whether or not he believed in God, or in a god. He once said: 

You may call me an agnostic... I do not share the crusading spirit of the professional atheist whose fervor is 
mostly due to a painful act of liberation from the fetters of religious indoctrination received in youth. I prefer 
an attitude of humility corresponding to the weakness of our intellectual understanding of nature and of our 
own beingP~^ 



Non-scientific legacy 

While travelling, Einstein wrote daily to his wife Elsa and adopted stepdaughters Margot and Use. The letters were 
included in the papers bequeathed to The Hebrew University. Margot Einstein permitted the personal letters to be 
made available to the public, but requested that it not be done until twenty years after her death (she died in 1986^ 
). Barbara Wolff, of The Hebrew University's Albert Einstein Archives, told the BBC that there are about 3,500 
pages of private correspondence written between 1912 and 1955 P 

Einstein bequeathed the royalties from use of his image to The Hebrew University of Jerusalem. Corbis, successor to 

The Roger Richman Agency, licenses the use of his name and associated imagery, as agent for the university/ 1 18] 
[119] 



In popular culture 

In the period before World War II, Einstein was so well-known in America that he would be stopped on the street by 
people wanting him to explain "that theory". He finally figured out a way to handle the incessant inquiries. He told 
his inquirers "Pardon me, sorry! Always I am mistaken for Professor Einstein. M ^ 120 ^ 

Einstein has been the subject of or inspiration for many novels, films, plays, and works of music J 121 ^ He is a favorite 
model for depictions of mad scientists and absent-minded professors; his expressive face and distinctive hairstyle 
have been widely copied and exaggerated. Time magazine's Frederic Golden wrote that Einstein was "a cartoonist's 
dream come true"J 122] 
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Awards and honors 

n 23] 

In 1922, Einstein was awarded the 1921 Nobel Prize in Physics, 
"for his services to Theoretical Physics, and especially for his 
discovery of the law of the photoelectric effect". This refers to his 1905 
paper on the photoelectric effect, "On a Heuristic Viewpoint 
Concerning the Production and Transformation of Light", which was 
well supported by the experimental evidence by that time. The 
presentation speech began by mentioning "his theory of relativity 
[which had] been the subject of lively debate in philosophical circles 
[and] also has astrophysical implications which are being rigorously 
examined at the present time". (Einstein 1923) 

It was long reported that Einstein gave the Nobel prize money to his 
first wife, Mileva Marie, in compliance with their 1919 divorce 
settlement. However, personal correspondence made public in 
2006^ 124 ^ shows that he invested much of it in the United States, and 
saw much of it wiped out in the Great Depression. 




In 1999 Albert Einstein was named the Person of 
the Century. 



In 1929, Max Planck presented Einstein with the Max Planck medal of 
the German Physical Society in Berlin, for extraordinary achievements 
in theoretical physics J 125 ^ 

In 1936, Einstein was awarded the Franklin Institute's Franklin Medal for his extensive work on relativity and the 
photo-electric effect. 



[125] 



The International Union of Pure and Applied Physics named 2005 the "World Year of Physics" in commemoration 
of the 100th anniversary of the publication of the annus mirabilis papers J- 126 ^ 

The Albert Einstein Science Park is located on the hill Telegrafenberg in Potsdam, Germany. The best known 
building in the park is the Einstein Tower which has a bronze bust of Einstein at the entrance. The Tower is an 

n 27] 

astrophysical observatory that was built to perform checks of Einstein's theory of General Relativity. 

The Albert Einstein Memorial in central Washington, D.C. is a monumental bronze statue depicting Einstein seated 
with manuscript papers in hand. The statue, commissioned in 1979, is located in a grove of trees at the southwest 
corner of the grounds of the National Academy of Sciences on Constitution Avenue. 

n 28i 

The chemical element 99, einsteinium, was named for him in August 1955, four months after Einstein's death. 

ri29i ri30i 
2001 Einstein is an inner main belt asteroid discovered on 5 March 1973. 

[1221 ri3ii 

In 1999 Time magazine named him the Person of the Century, 1 ahead of Mahatma Gandhi and Franklin 

Roosevelt, among others. In the words of a biographer, "to the scientifically literate and the public at large, Einstein 

n 32] 

is synonymous with genius". J Also in 1999, an opinion poll of 100 leading physicists ranked Einstein the 
ri33i 

"greatest physicist ever". 1 A Gallup poll recorded him as the fourth most admired person of the 20th century in 
theU.S. [134] 

In 1990, his name was added to the Walhalla temple for "laudable and distinguished Germans "J 135 ^ which is located 
east of Regensburg, in Bavaria, GermanyJ 136 ^ 

The United States Postal Service honored Einstein with a Prominent Americans series (1965-1978) 80 postage 
stamp. 
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Awards named after him 

The Albert Einstein Award (sometimes called the Albert Einstein Medal because it is accompanied with a gold 
medal) is an award in theoretical physics, established to recognize high achievement in the natural sciences. It was 
endowed by the Lewis and Rosa Strauss Memorial Fund in honor of Albert Einstein's 70th birthday. It was first 
awarded in 1951 and included a prize money of $ 15,000, [137] [138] which was later reduced to $ 5,000. [139] [140] The 
winner is selected by a committee (the first of which consisted of Einstein, Oppenheimer, von Neumann and 
Weyl^ 141 ^ ) of the Institute for Advanced Study, which administers the award J 138 ^ 

The Albert Einstein Medal is an award presented by the Albert Einstein Society in Bern, Switzerland. First given in 
1979, the award is presented to people who have "rendered outstanding services" in connection with Einstein J 142 ^ 

The Albert Einstein Peace Prize is given yearly by the Chicago, Illinois-based Albert Einstein Peace Prize 
Foundation. Winners of the prize receive $50,000 J 143] 



Publications 

The following publications by Albert Einstein are referenced in this article. A more complete list of his 
publications may be found at List of scientific publications by Albert Einstein. 

• Einstein, Albert (1901), "Folgerungen aus den Capillaritatserscheinungen (Conclusions Drawn from the 
Phenomena of Capillarity)", Annalen der Physik 4: 513, doi: 10. 1002/andp. 1901 3090306 

• Einstein, Albert (1905a), "On a Heuristic Viewpoint Concerning the Production and Transformation of Light" 
^ 145 ^, Annalen der Physik 17: 132-148 . This annus mirabilis paper on the photoelectric effect was received by 
Annalen der Physik 18 th March. 

• Einstein, Albert (1905b), A new determination of molecular dimensions. This PhD thesis was completed 30 th 
April and submitted 20 th July. 

• Einstein, Albert (1905c), "On the Motion - Required by the Molecular Kinetic Theory of Heat - of Small 
Particles Suspended in a Stationary Liquid", Annalen der Physik 17: 549-560. This annus mirabilis paper on 
Brownian motion was received 1 1 th May. 

• Einstein, Albert (1905d), "On the Electrodynamics of Moving Bodies", Annalen der Physik 17: 891-921. This 
annus mirabilis paper on special relativity was received 30 th June. 

• Einstein, Albert (1905e), "Does the Inertia of a Body Depend Upon Its Energy Content?", Annalen der Physik 18: 
639-641. This annus mirabilis paper on mass-energy equivalence was received 27 th September. 

• Einstein, Albert (1915), "Die Feldgleichungen der Gravitation (The Field Equations of Gravitation)", Koniglich 
Preussische Akademie der Wissenschaften: 844-847 

• Einstein, Albert (1917a), "Kosmologische Betrachtungen zur allgemeinen Relativitatstheorie (Cosmological 
Considerations in the General Theory of Relativity)", Koniglich Preussische Akademie der Wissenschaften 

• Einstein, Albert (1917b), "Zur Quantentheorie der Strahlung (On the Quantum Mechanics of Radiation)", 
Physikalische Zeitschrift 18: 121-128 

• Einstein, Albert (11 July 1923), "Fundamental Ideas and Problems of the Theory of Relativity" ^ U6 \ Nobel 
Lectures, Physics 1901—1921, Amsterdam: Elsevier Publishing Company, retrieved 25 March 2007 

• Einstein, Albert (1924), "Quantentheorie des einatomigen idealen Gases (Quantum theory of monatomic ideal 
gases)", Sitzungsberichte der P reus sichen Akademie der Wissenschaften Physikalisch-Mathematische Klasse: 
261—267. First of a series of papers on this topic. 

• Einstein, Albert (1926), "Die Ursache der Maanderbildung der Flusslaufe und des sogenannten Baerschen 
Gesetzes", Die Naturwissenschaften 14: 223-224, doi: 10. 1007/BF015 10300. On Baer's law and meanders in the 
courses of rivers. 

• Einstein, Albert; Podolsky, Boris; Rosen, Nathan (15 May 1935), "Can Quantum-Mechanical Description of 
Physical Reality Be Considered Complete?", Physical Review 47 (10): 777-780, doi:10.1103/PhysRev.47.777 
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• Einstein, Albert (1940), "On Science and Religion", Nature (Edinburgh: Scottish Academic) 146: 605, 
doi:10.1038/146605a0, ISBN 0707304539 

• Einstein, Albert et al (4 December 1948), "To the editors" [147] , New York Times (Melville, NY: AIP, American 
Inst, of Physics), ISBN 0735403597 

• Einstein, Albert (May 1949), "Why Socialism?" [148] , Monthly Review, retrieved 16 January 2006 

• Einstein, Albert (1950), "On the Generalized Theory of Gravitation", Scientific American CLXXXII (4): 13-17 

• Einstein, Albert (1954), Ideas and Opinions, New York: Random House, ISBN 0-517-00393-7 

• Einstein, Albert (1969) (in German), Albert Einstein, Hedwig undMax Born: Briefwechsel 1 916-1955, Munich: 
Nymphenburger Verlagshandlung, ISBN 388682005X 

• Einstein, Albert (1979), Autobiographical Notes, Paul Arthur Schilpp (Centennial ed.), Chicago: Open Court, 
ISBN 0-875-48352-6. The chasing a light beam thought experiment is described on pages 48-51. 

• Collected Papers: Stachel, John, Martin J. Klein, a. J. Kox, Michel Janssen, R. Schulmann, Diana Komos 
Buchwald and others (Eds.) (1987-2006), The Collected Papers of Albert Einstein, Vol. I— 10 ^ l49 \ Princeton 
University Press Further information about the volumes published so far can be found on the webpages of the 
Einstein Papers Project ^ 150 ^ and on the Princeton University Press Einstein Page ^ 151 ^ 
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External links 

• Works by Albert Einstein (public domain in Canada) 

• The MacTutor History of Mathematics archive (http://www-history.mcs.st-andrews.ac.uk/Biographies/ 
Einstein.html), School of Mathematics and Statistics, University of St Andrews, Scotland, April 1997, retrieved 
14 June 2009 

• Why Socialism? (http://www.monthlyreview.org/598einstein.php) by Albert Einstein, Monthly Review, May 
1949 

• Nobelprize.org Biography: Albert Einstein (http://nobelprize.org/nobel_prizes/physics/laureates/1921/ 
einstein-bio.html) 

• The Einstein You Never Knew (http://www.life.com/image/first/in-gallery/41492/ 
the-einstein-you-never-knew) - slideshow by Life magazine 

• Albert Einstein (http://www.history.com/topics/albert-einstein)~Watch Videos 

• Science Odyssey People And Discoveries (http://www.pbs.org/wgbh/aso/databank/entries/bpeins.html) 

Authority control: PND: 118529579 (http://d-nb.info/gnd/118529579) I LCCN: n79022889 (http://errol.oclc. 
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Known for 


Born-Haber cycle 
Born rigidity 
Born approximation 
Born-Infeld theory 
Born-Oppenheimer approximation 
Born's Rule 
Born-Lande equation 


Notable awards 


Nobel Prize in Physics (1954) 



Max Born (11 December 1882 - 5 January 1970) was a German born physicist and mathematician who was 
instrumental in the development of quantum mechanics. He also made contributions to solid-state physics and optics 
and supervised the work of a number of notable physicists in the 1920s and 30s. Born won the 1954 Nobel Prize in 
Physics (shared with Walther Bothe). 

Early life and education 

Max was born on December 11, 1882 in Breslau (now Wroclaw, Poland), which at Born's birth was in the Prussian 
Province of Silesia in the German Empire. He was one of two children born to Gustav Born, (b. 22 April 1850, 
Kempen, d. 6 July 1900, Breslau), an anatomist and embryologist, and Margarethe ('Gretchen') Kauffmann (b. 22 
January 1856, Tannhausen, d. 29 August 1886, Breslau), from a Silesian family of industrialists. 

Gustav and Gretchen married on 7 May 1881. She died when Max was just four years old, on 29 August 1886. 

Max had a sister Kathe (b. 5 March 1884), and a half-brother Wolfgang (b. 21 October 1892) from his father's 
second marriage (m. 13 September 1891) with Bertha Lipstein. 

Initially educated at the Konig-Wilhelm-Gymnasium, Born went on to study at the University of Breslau followed by 
Heidelberg University and the University of Zurich. During study for his Ph.D.^ and Habilitation ^ at the 
University of Gottingen, he came into contact with many prominent scientists and mathematicians including Klein, 
Hilbert, Minkowski, Runge, Schwarzschild, and Voigt. In 1908-1909 he studied at Gonville and Caius College, 
Cambridge. 

When Born arrived in Gottingen in 1904, Klein, Hilbert, and Minkowski 1 were the high priests of mathematics and 
were known as the "mandarins." Very quickly after his arrival, Born formed close ties to the latter two men. From the 
first class he took with Hilbert, Hilbert identified Born as having exceptional abilities and selected him as the lecture 
scribe, whose function was to write up the class notes ^ for the students' mathematics reading room at the University 
of Gottingen. Being class scribe put Born into regular, invaluable contact with Hilbert, during which time Hilbert' s 
intellectual largesse benefited Born's fertile mind. Hilbert became Born's mentor and Hilbert eventually selected him 
to be the first to hold the unpaid, semi-official position of Hilbert' s assistant. Born's introduction to Minkowski came 
through Born's stepmother, Bertha, as she knew Minkowski from dancing classes in Konigsberg. The introduction 
netted Born invitations to the Minkowski household for Sunday dinners. In addition, while performing his duties as 
scribe and assistant, Born often saw Minkowski at Hilbert's house. Born's outstanding work on elasticity - a subject 
near and dear to Klein - became the core of his magna cum laude Ph.D. thesis, in spite of some of Born's 
irrationalities in dealing with Klein 

Born married Hedwig, nee Ehrenberg, who was, like Born, of Jewish descent (although a practising Christian), on 2 
August 1913, and converted to the Lutheran faith soon thereafter; the marriage produced three children including G. 
V. R. Born. His daughter Irene was the mother of British-born Australian singer and actress Olivia Newton- John.Via 
marriage, he is the great uncle of British alternative comedian Ben Elton. 
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Career 

After Max's Habilitation in 1909, he settled in as a young academic at Gottingen as a Privatdozent (Associate 
Professor) J 6 ^ In Gottingen, Born stayed at a boarding house run by Sister Annie at DahlmannstraBe 17, known as El 
BoKaReBo The name was derived from the first letters of the last names of its boarders: "El" for Ella Philipson (a 
medical student), "Bo" for Born and Hans Bolza (a physics student), "Ka" for Theodore von Karman (a 
Privatdozent), and "Re" for Albrecht Renner (a medical student). A frequent visitor to the boarding house was Paul 
Peter Ewald, a doctoral student of Arnold Sommerfeld on loan to David Hilbert at Gottingen as a special assistant for 
physics. Richard Courant, a mathematician and Privatdozent, called these people the "in group." 

From 1915 to 1919, except for a period in the German army, Born was extraordinarius professor of theoretical 

physics at the University of Berlin, where he formed a life-long friendship with Albert Einstein. In 1919, he became 

ordinarius professor on the science faculty at the University of Frankfurt am Main. While there, the University of 

Gottingen was looking for a replacement for Peter Debye, and the Philosophy Faculty had Born at the top of their 

list. In negotiating for the position with the education ministry, Born arranged for another chair at Gottingen and for 

his long-time friend and colleague James Franck to fill itP^ In 1921, Born became ordinarius professor of theoretical 

physics and Director of the new Institute of Theoretical Physics at Gottingen J 10 ^ While there, he formulated^ 1 ^ the 

now-standard interpretation of the probability density function for in the Schrodinger equation of quantum 

ri2i 

mechanics, published in July 1926 and for which he was awarded the Nobel Prize in Physics in 1954, some three 
decades later. 

For the 12 years Born and Franck were at Gottingen (1921-1933), Born had a collaborator with shared views on 
basic scientific concepts — a distinct advantage for teaching and his research on the developing quantum theory. The 
approach of close collaboration between theoretical physicists and experimental physicists was also shared by Born 
at Gottingen and Arnold Sommerfeld at the University of Munich, who was ordinarius professor of theoretical 
physics and Director of the Institute of Theoretical Physics — also a prime mover in the development of quantum 
theory. Born and Sommerfeld not only shared their approach in using experimental physics to test and advance their 
theories, Sommerfeld, in 1922 when he was in the United States lecturing at the University of Wisconsin-Madison, 
sent his student Werner Heisenberg to be Born's assistant. Heisenberg again returned to Gottingen in 1923 and 
completed his Habilitation under Born in 1924 and became a Privatdozent at Gottingen - the year before Heisenberg 
and Born published their first papers on matrix mechanics J 13 ^ ^ 

In 1925, Born and Werner Heisenberg formulated the matrix mechanics representation of quantum mechanics. On 9 
July, Heisenberg gave Born a paper to review and submit for publication.^ 15 ^ In the paper, Heisenberg formulated 
quantum theory avoiding the concrete but unobservable representations of electron orbits by using parameters such 
as transition probabilities for quantum jumps, which necessitated using two indexes corresponding to the initial and 

final states J When Born read the paper, he recognized the formulation as one which could be transcribed and 

n 71 n ri 

extended to the systematic language of matrices, which he had learned from his study under Jakob Rosanes at 

Breslau University. Born, with the help of his assistant and former student Pascual Jordan, began immediately to 

make the transcription and extension, and they submitted their results for publication; the paper was received for 

publication just 60 days after Heisenberg's paperJ 19 ^ A follow-on paper was submitted for publication before the end 

of the year by all three authors P 0 ^ (A brief review of Born's role in the development of the matrix mechanics 

formulation of quantum mechanics along with a discussion of the key formula involving the non-commutativity of 

Hi] 

the probability amplitudes can be found in an article by Jeremy Bernstein. A detailed historical and technical 

account can be found in Mehra and Rechenberg's book The Historical Development of Quantum Theory. Volume 3. 

[221 

The Formulation of Matrix Mechanics and Its Modifications 1925—1926. ) 

Up until this time, matrices were seldom used by physicists; they were considered to belong to the realm of pure 
mathematics. Gustav Mie had used them in a paper on electrodynamics in 1912 and Born had used them in his work 
on the lattices theory of crystals in 1921. While matrices were used in these cases, the algebra of matrices with their 
multiplication did not enter the picture as they did in the matrix formulation of quantum mechanics. 
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Born, however, had learned matrix algebra from Rosanes, as already noted, but Born had also learned Hilbert's 
theory of integral equations and quadratic forms for an infinite number of variables as was apparent from a citation 
by Born of Hilbert's work Grundziige einer allgemeinen Theorie der Linear en Integralgleichungen published in 
1912. [24] [25] Jordan, too was well equipped for the task. For a number of years, he had been an assistant to Richard 
Courant at Gottingen in the preparation of Courant and David Hilbert's book Methoden der mathematischen Physik /, 
which was published in 1924.^ This book, fortuitously, contained a great many of the mathematical tools necessary 
for the continued development of quantum mechanics. In 1926, John von Neumann became assistant to David 
Hilbert, and he would coin the term Hilbert space to describe the algebra and analysis which were used in the 

T271 T281 

development of quantum mechanics. 

T291 

In 1928, Albert Einstein nominated Heisenberg, Born, and Jordan for the Nobel Prize in Physics, but it was not to 

be. The announcement of the Nobel Prize in Physics for 1932 was delayed until November 1933.^ It was at that 

time that it was announced Heisenberg had won the Prize for 1932 "for the creation of quantum mechanics, the 

application of which has led to the discovery of the allotropic forms of hydrogen" and Erwin Schrodinger and 

Paul Adrien Maurice Dirac shared the 1933 Prize "for the discovery of new productive forms of atomic theory". 

One can rightly ask why Born was not awarded the Prize in 1932 along with Heisenberg - Bernstein gives some 

speculations on this matter. One of them is related to Jordan joining the Nazi Party on 1 May 1933 and becoming a 

Storm Trooper. Hence, Jordan's Party affiliations and Jordan's links to Born may have affected Born's chance at 

the Prize at that time. Bernstein also notes that when Born won the Prize in 1954, Jordan was still alive, and the Prize 

T331 

was awarded for the statistical interpretation of quantum mechanics, attributable alone to Born. 

Heisenberg' s reaction to Born for Heisenberg himself receiving the Prize for 1932 and Born receiving the Prize in 
1954 is also instructive in evaluating whether Born should have shared the Prize with Heisenberg. On 25 November 
1933 Born received a letter from Heisenberg in which he said he had been delayed in writing due to a "bad 
conscience" that he alone had received the Prize "for work done in Gottingen in collaboration — you, Jordan and I." 
Heisenberg went on to say that Born and Jordan's contribution to quantum mechanics cannot be changed by "a 
wrong decision from the outside. In 1954, Heisenberg wrote an article honoring Max Planck for his insight in 
1900. In the article, Heisenberg credited Born and Jordan for the final mathematical formulation of matrix mechanics 
and Heisenberg went on to stress how great their contributions were to quantum mechanics, which were not 
"adequately acknowledged in the public eye." 

Those who received their Ph.D. degrees under Born at Gottingen included Max Delbriick, Walter Elsasser, Friedrich 
Hund, Pascual Jordan, Maria Goeppert-Mayer, Lothar Wolfgang Nordheim, J. Robert Oppenheimer, and Victor 
WeisskopfJ 36 ^ Born's assistants at the University of Gottingen's Institute for Theoretical Physics included Enrico 
Fermi, Werner Heisenberg, Gerhard Herzberg, Friedrich Hund, Pascual Jordan, Wolfgang Pauli, Leon Rosenfeld, 
Edward Teller, and Eugene WignerJ 36 ^ ^ ^ Walter Heitler became an assistant to Born in 1928 and under Born 
completed his Habilitation in 1929. Born not only recognized talent to work with him, but he let his "superstars 
stretch past him." ^ His Ph.D. student Delbriick, and six of his assistants (Fermi, Heisenberg, Goeppert-Mayer, 
Herzberg, Pauli, Wigner) went on to win Nobel Prizes. 

In a letter to Born in 1926, Einstein made his famous remark regarding quantum mechanics, often paraphrased as 
"The Old One does not play dice." [41] 

In 1933 Born emigrated from Germany. He had strong and public pacifist opinions; moreover, though Born was a 
Lutheran, he was classified as a "Jew" by the Nazi racial laws due to his ancestry, and was thus stripped of his 
professorship. He took up a position as Stokes Lecturer at the University of Cambridge. From 1936 to 1953 he was 
Tait Professor of Natural Philosophy at the University of Edinburgh. He became a British subject and a Fellow of the 
Royal Society of London in 1939. [42] 

Born had a dislike for nuclear weapons research, but he still acknowledged "it might be the only way out."^ 43] Much 
of the theoretical power behind the development of the first atomic bomb was due to many of those surrounding him 
at Gottingen and working on atomic physics and quantum mechanics: three of his Ph.D. students (Maria 
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Goeppert-Mayer, Oppenheimer and Weisskopf), three of his assistants (Fermi, Teller, and Wigner), the Director of 
the Second Institute for Experimental Physics (James Franck), and David Hilbert's assistant (John von Neumann) J 44] 

Max and Hedwig Born retired to Bad Pyrmont (10 km south of Hamelin) in West Germany, in 1954 J 45 ^ 

Born was one of the 11 signatories to the Russell-Einstein Manifesto. 

Born is also the great-grandfather of the famous TV editor and percussionist Kip Thompson-Born. 

Born died in Gottingen, Germany. He is buried there in the same cemetery as Walther Nernst, Wilhelm Weber, Max 
von Laue, Max Planck, and David Hilbert. 



Max Born Prize 

In memory of his important contributions, the Max Born prize was created by the German Physical Society and the 
British Institute of Physics. It is awarded annually. 



Published works 

During his life, Born wrote several semi-popular and technical books. His volumes on topics like atomic physics and 
optics were very well-received and are considered classics in their fields which are still in print. The following is a 
listing of his major works: 

• tjber das Thomson 'sche Atommodell Habilitations-Vortag (FAM, 1909) - The Habilitation was done at the 
University of Gottingen, on 23 October 1909. [46] 

• Dynamik der Kristallgitter (Teubner, 1915) - After its publication, the physicist Arnold Sommerfeld asked 
Born to write an article based on it for the 5th volume of the Mathematical Encyclopedia. World War I delayed 
the start of work on this article, but it was taken up in 1919 and finished in 1922. It was published as a revised 
edition under the title Atomic Theory of Solid States P^ 

• Dynamical Theory of Crystal Lattices, with Kun Huang. (Oxford, Clarendon Press, 1954) ^ 

• Die Relativitatstheorie Einsteins und ihre physikalischen Grundlagen (Springer, 1920) - Based on Born's lectures 
at the University of Frankfurt am Main.'- 50 -' 

• Available in English under the title Einstein 's Theory of Relativity P 1 ^ 

• Vorlesungen iiber Atommechanik (Springer, 1925) 

• Mechanics of the Atom (George Bell & Sons, 1927) - Translated by J. W. Fisher and revised by D. R. 
HartreeJ 52 ^ 

• Problems of Atomic Dynamics (MIT Press, 1926) - A first account of matrix mechanics being developed in 
Germany, based on two series of lectures given at MIT, over three months, in late 1925 and early 1926 P^ ^ 

• Elementare Quantenmechanik (Zweiter Band der Vorlesungen iiber Atommechanik), with Pascual Jordan. 
(Springer, 1930) - This was the first volume of what was intended as a two-volume work. This volume was 
limited to the work Born did with Jordan on matrix mechanics. The second volume was to deal with Erwin 
Schrodinger's wave mechanics. However, the second volume was not even started by Born, as he believed his 
friend and colleague Hermann Weyl had written it before he could do soP 5 ^ ^ 

• Optik: Ein Lehrbuch der elektromagnetische Lichttheorie (Springer, 1933) - The book was released just as the 
Borns were emigrating to England. 

T571 

• Principles of Optics: Electromagnetic Theory of Propagation, Interference and Diffraction of Light, with 
Emil Wolf. (Pergamon, 1959) - This book is not an English translation of Optik, but rather a substantially new 
book. Shortly after World War II, a number of scientists suggested that Born update and translate his work into 
English. Since there had been many advances in optics in the intervening years, updating was warranted. In 
1951, Emil Wolf began as Born's private assistant on the book; it was eventually published in 1959 by Robert 
Maxwell's Pergamon Press - the delay being due to the lengthy time needed "to resolve all the financial and 
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T591 

publishing tricks created by Maxwell." 



Moderne Physik (1933) - Based on seven lectures given at the Technischen Hochschule BerlinJ 60 ^ 

• Atomic Physics (Blackie, London, 1935) - Authorized translation of Moderne Physik by John Dougall, with 
updates/ 61 ^ 

The Restless Universe ^ (Blackie and Son Limited, 1935) - A popularized rendition of the workshop of nature. 
Born's nephew, Otto Konigsberger, whose successful career as an architect in Berlin was brought to an end when 
the Nazis took over, was temporarily brought to England to illustrate the bookJ 60 ^ 

Experiment and Theory in Physics (Cambridge University Press, 1943) — The address given King's College, 
Newcastle-on-Tyne, at the request of the Durham Pholosophical Society and the Pure Science Society. An 
expanded version of the lecture appeared in a 1956 Dover Publications edition. ^ 

Natural Philosophy of Cause and Chance (Oxford University Press, 1949) - Based on Born's 1948 Waynflete 
lectures, given at the College of St. Mary Magdalen, Oxford University. A later edition (Dover, 1964) included 
two appendices: "Symbol and Reality" and Born's lecture given at the Nobel laureates 1964 meeting in Landau, 
Germany J 64 ^ 

A General Kinetic Theory of Liquids with H. S. Green (Cambridge University Press, 1949) - The six papers in 
this book were reproduced with permission from the Proceedings of the Royal Society. 
Physics in My Generation: A Selection of Papers (Pergamon, 1956) ^ 
Physik im Wandel meinerZeit (Vieweg, 1957) 
Physik undPolitik (VandenHoeck und Ruprecht, 1960) 

Zur Begriindung der Matrizenmechanik, with Werner Heisenberg and Pascual Jordan (Battenberg, 1962) - 
Published in honor of Max Born's 80th birthday. This edition reprinted the authors' articles on matrix mechanics 
published in Zeitscrift fur Physik , Volumes 26 and 33-35, 1924-1 926. [66] 

My Life and My Views: A Nobel Prize Winner in Physics Writes Provocatively on a Wide Range of Subjects 
(Scribner, 1968) - Part II (pp. 63-206) is a translation of Verantwortung des Naturwissenschaftlers.^ 
Briefwechsel 1916-1955, kommentiert von Max Born with Hedwig Born and Albert Einstein (Nymphenburger, 
1969) 

• The Born-Einstein Letters: Correspondence between Albert Einstein and Max and Hedwig Born from 
1916-1955, with commentaries by Max Born (Macmillan, 1971). t68] 

Mein Leben: Die Erinnerungen des Nobelpreistrdgers (Munich: Nymphenburger, 1975). Born's published 
memoirs. 

• My Life: Recollections of a Nobel Laureate (Scribner, 1978). t69] Translation of Mein Leben. 
Born Nobel Prize Speech [70] - 1954 

Born Nobel Prize Lecture [71] - 1954 

Published papers (as listed on the Smithsonian/NASA Astrophysics Data System (ADS)) 
T731 

Published papers (as listed on HistCite) 
T741 

Published Books (based on the Library of Congress citations) 

T751 

Published Works - Berlin-Brandenburgische Akademie der Wissenschaften Akademiebibliothek 
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Selected journal literature 

While links have been provided in this article to journal publications by Born, a few of his papers are worth 
highlighting here along with citations to translations in English. 

Matrix Mechanics A trilogy of papers launched the matrix mechanics formulation of quantum mechanics: 

• W. Heisenberg, Uber quantentheoretische Umdeutung kinematischer und mechanischer Beziehungen, Zeitschrift 
fur Physik, 33, 879-893, 1925 (received 29 July 1925). [English translation in: B. L. van der Waerden, editor, 
Sources of Quantum Mechanics (Dover Publications, 1968) ISBN 0-486-61881-1 (English title: 
Quantum-Theoretical Re -interpretation of Kinematic and Mechanical Relations).] 

• M. Born and P. Jordan, Zur Quantenmechanik, Zeitschrift fur Physik, 34, 858-888, 1925 (received 27 September 
1925). [English translation in: B. L. van der Waerden, editor, Sources of Quantum Mechanics (Dover 
Publications, 1968) ISBN 0-486-61881-1 (English title: On Quantum Mechanics).] 

• M. Born, W. Heisenberg, and P. Jordan, Zur Quantenmechanik II, Zeitschrift fiir Physik, 35, 557-615, 1926 
(received 16 November 1925). [English translation in: B. L. van der Waerden, editor, Sources of Quantum 
Mechanics (Dover Publications, 1968) ISBN 0-486-61881-1] 

Probability Density The now-standard interpretation of the probability density function for in the Schrodinger 
equation of quantum mechanics was published by Born in the first of these two papers, and it is this for which he 
was awarded the Nobel Prize in Physics in 1954. The second paper is a continuation and extension of the analysis 
provided in the first paper. 

• Max Born (1926). "Zur Quantenmechanik der StoBvorgange" ^ 16 \ Zeitschrift fur Physik 37: 863-867. 
doi:10.1007/BF01397477. Retrieved 2009-12-16. 

• Max Born (1926). "Quantenmechanik der StoBvorgange" [77] . Zeitschrift fur Physik 38: 803-827. 
doi:10.1007/BF01397184. Retrieved 2009-12-18. "English translation in: Gunther Ludwig, editor, Wave 
Mechanics (Pergamon, 1968) ISBN 08-103204-8. Under the title: Quantum Mechanics of Collision Processes". 



Awards and honors 

1934 - Stokes Medal of Cambridge [78] 
1939 - Fellow of the Royal Society [78] 

1945 - MacDougall-Brisbane Medal of the Royal Society of Edinburgh 
1945 - Gunning- Victoria Jubilee Prize of the Royal Society of Edinburgh t78] 

T781 

1948 - Max Planck Medaille der Deutschen Physikalischen Gesellschaft 
1950 - Hughes Medal of the Royal Society of London [78] 

T781 

1953 - Honorary citizen of the town of Gottingen 

1954 - Nobel Prize in Physics The award was for Born's fundamental research in quantum mechanics, especially 

T781 

for his statistical interpretation of the wavefunction. 

• 1954 - Nobel Prize Speech [70] 

• 1954 - Born Nobel Prize Lecture [71] 

T781 

1956 - Hugo Grotius Medal for International Law, Munich 

1959 - Grand Cross of Merit with Star of the Order of Merit of the German Federal Republic [80] 
1982 - Ceremony at the University of Gottingen in the 100th Birth Year of Max Born and James Franck, Institute 
Directors 1921 - 1933. [81] 

T821 

Max-Born Institut fiir Nichtlineare Optik und Kurzzeitspektroskopie im Forschungsverbund Berlin e.V. 
Institute named in his honor. 
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Eugene Wigner 

The native form of this personal name is Wigner J eno. This article uses the Western name order. 



Eugene P. Wigner 


Mi 

Ik* 
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Jersey, 

United States 
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Victor Frederick Weisskopf 
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Abner Shimony 
Edwin Thompson Jaynes 
Frederick Seitz 
Conyers Herring 
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Known for 


Law of conservation of parity 




Wigner D-matrix 




Wigner-Eckart theorem 




Wigner' s friend 




Wigner semicircle distribution 




Wigner' s classification 




Wigner quasi-probability distribution 




Wigner crystal 




Wigner effect 




Wigner-Seitz cell 




Relativistic Breit-Wigner distribution 




Modified Wigner distribution function 




Wigner-d'Espagnat inequality 




Gabor-Wigner transform 




Wigner' s theorem 




Wigner distribution 




Jordan-Wigner transformation 




Newton-Wigner localization 




Wigner-Seitz radius 




6-j symbol 




9-j symbol 
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Eugene Feenberg 




George Cowan 




Robert Serber 




Igal Talmi 


Notable awards 


Enrico Fermi Award (1958) 




Max Planck Medal (1961) 




Nobel Prize in Physics (1963) 




National Medal of Science (1969) 


Signature 




Notes 




He was Paul Dirac's brother-in-law and the uncle of Gabriel Andrew Dirac. 



Eugene Paul "E. P. M Wigner (Hungarian Wigner Jeno Pal; November 17, 1902 - January 1, 1995) was a 
Hungarian American physicist and mathematician. 

He received a share of the Nobel Prize in Physics in 1963 "for his contributions to the theory of the atomic nucleus 
and the elementary particles, particularly through the discovery and application of fundamental symmetry 
principles"; the other half of the award was shared between Maria Goeppert-Mayer and J. Hans D. Jensen. Some 
contemporaries referred to Wigner as the Silent Genius and some even considered him the intellectual equal to 
Albert Einstein, though without his prominence. Wigner is important for having laid the foundation for the theory of 
symmetries in quantum mechanics as well as for his research into the structure of the atomic nucleus, and for his 
several mathematical theorems. It was Eugene Wigner who first identified Xe-135 "poisoning" in nuclear reactors, 
and for this reason it is sometimes referred to as Wigner poisoning.^ 
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Early life 




Werner Heisenberg + Eugene Wigner (1928) 



Wigner was born in Budapest, Austria-Hungary, into a middle class 
Jewish family. At the age of 11, Wigner contracted what his parents 
believed to be tuberculosis. They sent him to live for six weeks in a 
sanitarium in the Austrian mountains. During this period, Wigner 
developed an interest in mathematical problems. From 1915 through 
1919, together with John von Neumann, Wigner studied at the Fasori 
Evangelikus Gimnazium, where they both benefited from the 
instruction of the noted mathematics teacher Laszlo Ratz. In 1919, to 
escape the Bela Kun communist regime, the Wigner family briefly 
moved to Austria, returning to Hungary after Kun's downfall. Partly as 
a reaction to the prominence of Jews in the Kun regime, the family 



converted to Lutheranism. 

In 1921, Wigner studied chemical engineering at the Technische Hochschule in Berlin (today the Technische 
Universitat Berlin). He also attended the Wednesday afternoon colloquia of the German Physical Society. These 
colloquia featured such luminaries as Max Planck, Max von Laue, Rudolf Ladenburg, Werner Heisenberg, Walther 
Nernst, Wolfgang Pauli, and Albert Einstein. Wigner also met the physicist Leo Szilard, who at once became 
Wigner's closest friend. A third experience in Berlin was formative. Wigner worked at the Kaiser Wilhelm Institute 
for Physical Chemistry and Elektrochemistry (now the Fritz Haber Institute), and there he met Michael Polanyi, who 
became, after Laszlo Ratz, Wigner's greatest teacher. 

Middle years 

In the late 1920s, Wigner deeply explored the field of quantum mechanics. A period at Gottingen as an assistant to 
the great mathematician David Hilbert proved a disappointment, as Hilbert was no longer active in his works. 
Wigner nonetheless studied independently. He laid the foundation for the theory of symmetries in quantum 
mechanics and in 1927 introduced what is now known as the Wigner D-matrix. It is safe to state that he and 
Hermann Weyl carry the whole responsibility for the introduction of group theory into quantum mechanics (they 
spread the "Gruppenpest"). See Wigner's 1931 monograph for a survey of his work on group theory. In the late 
1930s, he extended his research into atomic nuclei. He developed an important general theory of nuclear reactions 
(see for instance the Wigner-Eckart theorem). By 1929, his papers were drawing notice in the world of physics. In 
1930, Princeton University recruited Wigner, which was very timely, since the Nazis soon rose to power in 
Germany. At Princeton in 1934, Wigner introduced his sister Manci to the physicist Paul Dirac, whom she married. 

In 1936, Princeton University did not rehire Wigner, hence he searched for new employment. He found this at the 
University of Wisconsin. There he met his first wife, Amelia Frank, who was a physics student there. However she 
died unexpectedly in 1937, naturally leaving Wigner distraught. 

On January 8, 1937, Wigner became a naturalized citizen of the United States. Princeton University soon invited 
Wigner back into its employment, and he rejoined its faculty in Fall 1938. Although he was a professed political 
amateur, in 1939 and 1940 he played a major role in prompting the U.S. Government to establish the Manhattan 
Project, which developed the first atomic bomb by 1945. However, by his personal beliefs, Wigner was at heart a 
pacifist. Wigner was present at a converted squash courts at the University of Chicago's abandoned Stagg Field on 
December 2, 1942, when the world's first atomic reactor, Chicago Pile One (CP-1) achieved a nuclear chain reaction 
(a critical reaction). ^ He later contributed to civil defense in the U.S. In 1946, Wigner accepted a position as the 
Director of Research and Development at the Clinton Laboratory (now the Oak Ridge National Laboratory) in Oak 
Ridge, Tennessee. When his duties there did not work out especially well, Wigner returned to Princeton University. 
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In 1941, Wigner married his second wife, Mary Annette Wheeler, a professor of physics at Vassar College, who had 
completed her Ph.D. at Yale University in 1932. They remained married until her death in 1977, and they were the 
parents of two children. 



Last years 




In 1960, Wigner published a now classic article on the philosophy of 
mathematics and of physics, which has become his best-known work 
outside of technical mathematics and physics, "The Unreasonable 
Effectiveness of Mathematics in the Natural Sciences". He argued that 
biology and cognition could be the origin of physical concepts, as we 
humans perceive them, and that the happy coincidence that 
mathematics and physics were so well matched, seemed to be 
"unreasonable" and hard to explain. His reasoning was resisted by the 
Harvard mathematician Andrew M. Gleason. 



Patricia Eileen (left) and Eugene Paul Wigner at 

their home in Princeton. m 1963, Wigner was awarded the Nobel Prize in Physics. Wigner 

professed to never have considered the possibility that this might 

occur, and he added: "I never expected to get my name in the newspapers without doing something wicked." Wigner 

later won the Enrico Fermi award, and the National Medal of Science. In 1992, at the age of 90, Wigner published a 

memoir, The Recollections of Eugene P. Wigner with Andrew Szanton. Wigner died three years later in Princeton, 

New Jersey. One of his significant students was Abner Shimony. Wigner's third wife was Eileen Clare-Patton 

Hamilton Wigner ("Pat"), the widow of another physicist, Donald Ross Hamilton, who had died in 1972. (He had 

been the Dean of the Graduate School at Princeton University.) 



Near the end of his life, Wigner's thoughts turned more philosophical. 
In his memoirs, Wigner said: "The full meaning of life, the collective 
meaning of all human desires, is fundamentally a mystery beyond our 
grasp. As a young man, I chafed at this state of affairs. But by now I 
have made peace with it. I even feel a certain honor to be associated 
with such a mystery." He became interested in the Vedanta philosophy 
of Hinduism, particularly its ideas of the universe as an all pervading 
consciousness. [6] In his collection of essays Symmetries and 
Reflections — Scientific Essays, he commented "It was not possible to 




formulate the laws (of quantum theory) in a fully consistent way Feza Gursey (right) with Eugene Wigner, photo 

without reference to consciousness." by Y. S. Kim [5] (1988) (permission of Prof. Kim 

to release it to public domain). 

Wigner also conceived the Wigner's friend thought experiment in 

physics, which is an extension of the Schrodinger's cat thought experiment. The Wigner's friend experiment asks the 
question: "At what stage does a 'measurement' take place?" Wigner designed the experiment to highlight how he 
believed that consciousness is necessary to the quantum-mechanical measurement processes. 
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Honors 

• Franklin Medal, 1950 

• Atoms for Peace Award, 1959 

• Eugene P. Wigner Reactor Physicist Award at the American Nuclear Society. 

rsi 

• Enrico Fermi Award . 

rm 

• Wigner Fellowship Program at Oak Ridge National Laboratory (ORNL). 

• Walli, Ron. "Auditorium at ORNL Renamed in Honor of Eugene P. Wigner" ^ ORNL Press Release, (Jan. 11, 
1996). 



Publications 

• 1939, "On unitary representations of the inhomogeneous Lorentz group, ^^" Annals Math. 40: 149-204. 

• (with Creutz, E. C. & R. R. Wilson) "Absorption of Thermal Neutrons in Uranium, Princeton University, 
United States Department of Energy (through predecessor agency the Atomic Energy Commission), (Sept. 26, 
1941). 

• "Radioactivity of the Cooling Water, J " Metallurgical Laboratory of the University of Chicago, United States 
Department of Energy (through predecessor agency the Atomic Energy Commission), (March 1, 1943). 

• "Solutions of Boltzmann v s Equation for Mono-energetic Neutrons in an Infinite Homogeneous Medium, ^" 
Metallurgical Laboratory of the University of Chicago, United States Department of Energy (through predecessor 
agency the Atomic Energy Commission), (Nov. 30, 1943). 

• (with Weinberg, A. M. & J. Stephenson) "Recalculation of the Critical Size and Multiplication Constant of a 
Homogeneous UO{sub 2} - D{sub 2}0 Mixtures, tl5] " Metallurgical Laboratory of the University of Chicago, 
(Feb. 11, 1944). 

• (with F.L. Friedman) "On the Boundary Condition Between Two Multiplying Media, ^" Metallurgical 
Laboratory of the University of Chicago, (April 19, 1944). 

• (with J. E. Wilkins, Jr.) "Effect of the Temperature of the Moderator on the Velocity Distribution of Neutrons 
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with Numerical Calculations for H as Moderator, Oak Ridge National Laboratory (ORNL), United States 
Department of Energy (through predecessor agency the Atomic Energy Commission), (Sept. 14, 1944). 
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• "On the Variation of Eta with Energy in the 100-1000 ev Region, Brookhaven National Laboratory, United 

States Department of Energy (through predecessor agency the Atomic Energy Commission), (Nov. 1, 1949). 
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• "The Magnitude of the Eta Effect, Du Pont de Nemours (E.I.) & Co., United States Department of Energy 
(through predecessor agency the Atomic Energy Commission), (April 25, 1951). 

• 1958 (with Alvin M. Weinberg). Physical Theory of Neutron Chain Reactors (University of Chicago Press. ISBN 
0-226-88517-8 

• 1959. Group Theory and its Application to the Quantum Mechanics of Atomic Spectra. New Yor: Academic 
Press. Translation by J. J. Griffin of 1931, Gruppentheorie und ihre Anwendungen auf die Quantenmechanik der 
Atomspektren, Vieweg Verlag, Braunschweig. 

• 1960, "The Unreasonable Effectiveness of Mathematics in the Natural Sciences, t20] " Communications on Pure 
and Applied Mathematics 13(1): 1—14. 

• 1970. Symmetries and Reflections: Scientific Essays. MIT Press. ISBN 0-262-73021-9 

• 1992 (as told to Andrew Szanton). The Recollections of Eugene P. Wigner. Plenum. ISBN 0-306-44326-0 

• 1997 (with G. G. Emch; Jagdish Mehra and Arthur S. Wightman, eds.). Philosophical Reflections and Syntheses. 
Springer. ISBN 3-540-63372-3 
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External links 

• Annotated bibliography for Eugene Wigner from the Alsos Digital Library for Nuclear Issues (http://alsos.wlu. 
edu/qsearch.aspx?browse=people/Wigner,+Eugene) 

• Biography and Bibliographic Resources (http://www.osti.gov/accomplishments/wigner.html), from the Office 
of Scientific and Technical Information, United States Department of Energy 

• Oral history interview with Eugene P. Wigner (http://www.cbi. umn.edu/oh/display.phtml?id=77) Charles 
Babbage Institute, University of Minnesota, Minneapolis - Wigner talks about his association with John von 
Neumann during their school years in Hungary, their graduate studies in Berlin, and their appointments to 
Princeton in 1930. Wigner discusses von Neumann's contributions to the theory of quantum mechanics, Wigner's 
own work in this area, and von Neumann's interest in the application of theory to the atomic bomb project. 

• Eugene Wigner Biography (http://www.nobel-winners.com/Physics/eugene_paul_wigner.html) 

• Nobel Prize Biography (http://nobelprize.org/nobel_prizes/physics/laureates/1963/wigner-bio.html) 

• National Academy of Sciences biography (http://www.nap.edu/readingroom/books/biomems/ewigner.html) 

• O'Connor, John J.; Robertson, Edmund F., "Eugene Wigner" (http://www-history.mcs.st-andrews.ac.uk/ 
Biographies/Wigner.html), MacTutor History of Mathematics archive, University of St Andrews. 

• his contributions to the theory of the atomic nucleus and the elementary particles, particularly through the 
discovery and application of fundamental symmetry principles (http://geratorp.bravehost.com/dmx/ 
wigner-bio.html) 

• An interview with Wigner about his experience at Princeton (http://www.princeton.edu/-mudd/finding_aids/ 
mathoral/pmc44 . htm) 

• Oral history interview transcript with Eugene Wigner 21 November 1963, American Institute of Physics, Niels 
Bohr Library & Archives (http://www.aip.org/history/ohilist/4963_l.html) 

• Oral history interview transcript with Eugene Wigner 24 January 1981, American Institute of Physics, Niels Bohr 
Library & Archives (http://www.aip.org/history/ohilist/4965.html) 
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Alexandra Proca 



Alexandra Proca 


Born 


October 16, 1897Bucharest, Romania 


Died 


December 13, 1955 (aged 58)Paris, France 


Citizenship 


France 


Nationality 


Romania 


Fields 


Physicist (theoretical) 


Alma mater 


Paris-Sorbonne University in France 


Doctoral advisor 


Louis de Broglie 


Known for 


Proca' s equations 


Notable awards 


Honorary Member of the Romanian Academy of Arts and Sciences, elected post mortem in 1990. 



Alexandra Proca (October 16, 1897, Bucharest - December 13, 1955, Paris) was a Romanian physicist. He 
developed the meson theory of nuclear forces and the mathematical physics equations that bear his name (Proca's 
equations). He became a French citizen in 1931. 

Education 
High-school and college 

In Romania, he was one of the eminent students of the school "Gheorghe Lazar" and the Polytechnic School in 
Bucharest. With a very strong interest in theoretical physics, he went to Paris where he graduated in Science from the 
Paris-Sorbonne University, receiving from the hand of Marie Curie his diploma of the Bachelor of Science degree. 
Then, he was employed as a researcher/physicist at the Radium Institute in Paris in 1925. 

Ph.D. studies 

He carried out Ph.D. studies in theoretical physics under the supervision of Nobel laureate Louis de Broglie. He 
defended successfully his Ph.D. thesis entitled "On the relativistic theory of Dirac's electron" in front of an 
examination committee chaired by the Nobel laureate Jean Perrin. 

Scientific achievements 

He also studied and worked with Nobel laureates Niels Bohr and Marie Curie, ^ . Alexandru Proca became to be 
known as one of the most influential Romanian theoretical physicists of the last century. 1 having developed the 
meson theory of nuclear forces ahead of the first reports of Nobel laureate Hideki Yukawa. Proca's equations for the 
vectorial mesonic field were employed by Yukawa who subsequently received the Nobel Prize for an explanation of 
the nuclear forces by using this field. 
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Publications at the Library of Congress 

• Library of Congress 

Notes 

[1] Romanian review, Volume 30 (http://books. google. com/books ?id=K6viAAAAMAAJ&cd=l), Distributed by Europolis Pub., 1976, 
p. 105, 

[2] Brown, Laurie M.; Rechenberg, Helmut (1996), The origin of the concept of nuclear forces (http://books.google.com/ 

books ?id=IJPTgDTOmgMC), CRC Press, p. 185, ISBN 9780750303736, 
[3] http://catalog.loc.gov/cgi-bin/Pwebreconx 

CNT=100&PID=owd-ufzQyH-sy8CmbjVKECI_Mxe&SEQ=20100712075927&SID=4 
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Hideki Yukawa M)\ \ 9 


m 




Sffi 




Born 


23 January 1907Tokyo, Japan 


Died 


8 September 1981 (aged 74)Kyoto, Japan 


Nationality 


Japan 


Fields 


Theoretical Physics 


Institutions 


Osaka Imperial University 
Kyoto Imperial University 
Imperial University of Tokyo 
Institute for Advanced Study 
Columbia University 


Alma mater 


Kyoto Imperial University 


Influences 


Enrico Fermi 


Notable awards 


Nobel Prize in Physics (1949) 



Hideki Yukawa FRSE ftf Yukawa Hideki, 23 January 1907 - 8 September 1981) ne Ogawa (/J\ll|), was a 

Japanese theoretical physicist and the first Japanese Nobel laureate. 

Biography 

Yukawa was born in Tokyo, Japan. In 1929, after receiving his degree from Kyoto Imperial University, he stayed on 
as a lecturer for four years. After graduation, he was interested in theoretical physics, particularly in the theory of 
elementary particles. In 1932, he married Sumi (JX ^ ); they had two sons, Harumi and Takaaki. In 1933 he became 
an assistant professor at Osaka University, at age 26. 

In 1935 he published his theory of mesons, which explained the interaction between protons and neutrons, and was a 
major influence on research into elementary particles. In 1940 he became a professor in Kyoto University. In 1940 
he won the Imperial Prize of the Japan Academy, in 1943 the Decoration of Cultural Merit from the Japanese 
government. In 1949 he became a professor at Columbia University, the same year he received the Nobel Prize in 
Physics, after the discovery by Cecil Frank Powell, Giuseppe Occhialini and Cesar Lattes of Yukawa's predicted 
pion in 1947. Yukawa also worked on the theory of K-capture, in which a low energy electron is absorbed by the 
nucleus, after its initial prediction by G. C. WickJ 1 ^ 

Yukawa became the first chairman of Yukawa Institute for Theoretical Physics in 1953. He received a Doctorate, 
honoris causa, from the University of Paris and honorary memberships in the Royal Society of Edinburgh, the Indian 
Academy of Sciences, the International Academy of Philosophy and Sciences, and the Pontificia Academia 
Scientiarum. 
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He was an editor of Progress of Theoretical Physics, and published the papers Introduction to Quantum 
Mechanics (1946) and Introduction to the Theory of Elementary Particles (1948). 

In 1955, he joined ten other leading scientists and intellectuals in signing the Russell-Einstein Manifesto, calling for 
nuclear disarmament. 

Solo violinist Diana Yukawa ( 4 7 i~ J 1 1 ) is a relative of Hideki Yukawa. 

Bibliography 

• Profiles of Japanese science and scientists, 1970 1 Supervisory editor: Hideki Yukawa (1970) 

• Creativity and intuition : a physicist looks at East and West I by Hideki Yukawa ; translated by John B ester 
(1973) 

• Scientific works (1979) 

• Tabibito = The traveler I Hideki Yukawa ; translated by L. Brown & R. Yoshida (1982) 
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External links 

• Hideki Yukawa - Biography (http://www.nobel.se/physics/laureates/1949/yukawa-bio.html) 

• About Hideki Yukawa (http://www.nobel-winners.com/Physics/hideki_yukawa.html) 

• his prediction of the existence of mesons on the basis of theoretical work on nuclear forces. (http://holiker. 
narod.ru/four/yukawa-press.html) 
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Neville Mott 



Born 
Died 



Nationality 
Fields 



Institutions 



Alma mater 



Nevill Francis Mott 




Notable awards 



30 September 1905Leeds, England 

8 August 1996 (aged 90)Milton Keynes, Buckinghamshire, England 



United Kingdom 
Physics 



University of Manchester 

Gonville and Caius College, Cambridge 

University of Bristol 

St John's College, Cambridge 



Nobel Prize in Physics (1977) 



Sir Nevill Francis Mott, CH, FRS (30 September 1905 - 8 August 1996) was an English physicist. He won the 
Nobel Prize for Physics in 1977 for his work on the electronic structure of magnetic and disordered systems, 
especially amorphous semiconductors. The award was shared with Philip W. Anderson and J. H. Van Vleck, who 
had pursued independent research. 



Biography 

Early years 

Mott was born in Leeds to Lilian Mary Reynolds and Charles Francis Mott. and grew up first in the village of 
Giggleswick, in the West Riding of Yorkshire, where his father was Senior Science Master at the local school. It was 
a generally secular childhood. The family moved (due to his father's jobs) first to Staffordshire, then to Chester and 
finally Liverpool, where his father had been appointed Director of Education. Mott was at first educated at home by 
his mother, who was a Cambridge Mathematics Tripos graduate. His parents had actually met in the Cavendish 
Laboratory, when both engaged in Physics research. At ten years of age he began formal education at Clifton 
College in Bristol, then at St. John's College, Cambridge, where he read the Mathematics Tripos. 
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Career 

Mott was appointed to a lecturership at Manchester University in 1929. He returned to Cambridge in 1930 as a 
Fellow and lecturer of Gonville and Caius College and in 1933 moved to Bristol University as Melville Wills 
Professor in Theoretical Physics. 

In 1948 he became Henry Overton Wills Professor of Physics and Director of the Henry Herbert Wills Physical 
Laboratory at Bristol. In 1954 he was appointed Cavendish Professor of Physics at Cambridge, a post he held until 
1971. Additionally he served as Master of Gonville and Caius College, 1959-1966. 

Mott's accomplishments include explaining theoretically the effect of light on a photographic emulsion (see latent 
image) and outlining the transition of substances from metallic to nonmetallic states (Mott transition). The term Mott 
insulator is also named for him. 

Mott was elected a Fellow of the Royal Society in 1936. Mott served as president of the Physical Society in 1957. In 
the early 1960s he was chairman of the British Pugwash group. He was knighted in 1962. [1] He continued to work 
until he was about ninety. He was made a Companion of Honour in 1995. 

Personal life 

Mott was married to Ruth Eleanor Horder, and had two daughters, Elizabeth and Alice. He died in Milton Keynes in 
B uckinghamshire . 

Bibliography 

• N. F. Mott, Metal-Insulator Transitions, second edition (Taylor & Francis, London, 1990). ISBN 0850667836, 
ISBN 978-0850667837 

• N. F. Mott, A Life in Science, (Taylor & Francis, London, 1986). ISBN 0850663334, ISBN 978-0850663334 
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• Delightful BBC video of 1985 interview with Nevill Mott [3] (accessed Oct 8, 2010) 

• Nobel lecture [4] (PDF) 

• Sir Nevill Francis Mott [5] 

• Mott's memories ^ University of Bristol (accessed Jan 2006) 

• National Cataloguing Unit for the Archives of Contemporary Scientists Bath University 
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Paul W. Anderson 



Paul W. S. Anderson 




Anderson at WonderCon 2010 



Born 4 March 1965Newcastle-upon-Tyne, England 

Occupation Film director, producer and screenwriter 

Spouse Milla Jovovich (2009-present) 

Children Ever Gabo Anderson (3 November 2007) 

Paul William Scott Anderson (born 4 March 1965), also known as Paul W.S. Anderson or Paul Anderson, is an 

English film director who regularly works in science fiction movies and video game adaptations. 

Life and career 

Anderson was born in Newcastle upon Tyne, England. Educated at Newcastle's Royal Grammar School, Anderson 
went on to graduate from the University of Warwick as the youngest student to achieve a B A in Film & Literature. 
He made his debut as the writer-director of Shopping, which starred Sean Pertwee, Jude Law and Sadie Frost as 
thieves who smashed cars into storefronts. When released in the United Kingdom it was banned in some cinemas, 
and only gained a release in the United States as an edited, direct to video release. 

After this, he directed the successful 1995 video game adaptation Mortal Kombat. While prior video game movies, 
like Street Fighter and Super Mario Bros., had been all-out disasters, Mortal Kombat was well received by fans, and 
some critics. He declined to direct the sequel, Mortal Kombat: Annihilation, which was not well received by critics 
or fans. Anderson was asked to direct a third movie, Mortal Kombat: Devastation, but declined again. 

The success of Mortal Kombat gave Anderson free rein to choose his next project, Soldier, written by Blade Runner 
screenwriter David Webb Peoples. Intended as a "sidequel" to Blade Runner, the movie was set in the same universe 
(but not the same planet), and contained numerous references to the earlier film. Kurt Russell was attached to star, 
but was unavailable at the time, which delayed the production. In the meantime, Anderson made the science fiction 
horror film Event Horizon. It was poorly received at the box office, and Anderson blamed the failure on 
studio-enforced cuts. While not a box-office success, the film gained a small cult following. Soldier was eventually 
completed and released in 1998, but was a disaster both commercially and critically. 

After the poor performance of both Event Horizon and Soldier, Anderson was forced to think smaller. His planned 
remake of the cult film Death Race 2000 was put on hold, and he set about writing and directed a TV movie, The 
Sight, in 2000. It was a minor success, and Anderson returned to cinema screens in 2002 when he wrote and directed 
an adaptation of the survival horror series Resident Evil. At that point he began to credit himself as "Paul W. S. 
Anderson", to avoid confusion with the American director Paul Thomas Anderson. 
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Produced on a moderate budget in comparison to his earlier movies, Resident Evil was a commercial success in 
cinemas and on DVD, prompting Anderson to write (but not direct) the sequels, Resident Evil: Apocalypse and 
Resident Evil: Extinction both were commercially successful but failed critically. 

Anderson's next project was the much-anticipated Alien vs. Predator, a concept hinted at in Predator 2 and later 
popularized by a series of Dark Horse Comics. A movie version had been stuck in development hell for several years 
despite the franchise crossing into every other form of media, from books to comics to video games. The film was 
finally released in August 2004, and proved to be Anderson's most successful film to date, grossing $172,544,654^ 
internationally on a budget of $60 million, despite his success it received mostly negative reviews. After completing 
Alien vs. Predator Anderson rebooted his Death Race 2000 remake and finally got it released as Death Race in 2008. 

Anderson will next direct an adaptation of The Three Musketeers, who will be played by Logan Lerman, Matthew 

T21 

Macfadyen, Ray Stevenson and Luke Evans. 
Personal life 

In April 2007, People Magazine announced that he and actress Milla Jovovich were expecting a baby girl in 
November 2007. The two met when Anderson directed her in the first Resident Evil. They were engaged in March 
2003, and were married on August 22, 2009. ^ ^ Jovovich gave birth to their first child, a daughter, Ever Gabo 
Anderson, on 3 November 2007 at Cedars-Sinai Medical Center in Los Angeles, California, one day before her due 
date of November 4.^ 



Criticism 

Screenwriter Peter Briggs, who had penned the very first Alien vs. Predator screenplay, disputed some of Anderson's 
other comments in an online interview, saying Anderson's claim that Briggs' original screenplay was "locked down" 
was incorrect, and that many elements of Anderson's screenplay were suspiciously similar J 6 ^ 



Filmography 
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Film 
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Director 


Writer 


Producer 


1994 


Shopping 


Yes 


Yes 




1995 


Mortal Kombat 


Yes 






1997 


Event Horizon 


Yes 






1998 


Soldier 


Yes 






2000 


The Sight 


Yes 


Yes 


Yes 


2002 


Resident Evil 


Yes 


Yes 


Yes 


2004 


Alien vs. Predator 


Yes 


Yes 




Resident Evil: Apocalypse 




Yes 


Yes 


2005 


The Dark 






Yes 


2007 


DO A: Dead or Alive 






Yes 


Resident Evil: Extinction 




Yes 


Yes 


2008 


Death Race 


Yes 


Yes 


Yes 


2009 


Pandorum 






Yes 


2010 


Resident Evil: Afterlife 


Yes 


Yes 


Yes 


2011 


The Three Musketeers 


Yes 




Yes 
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George Karreman 


Born 


November 4, 1921 Rotterdam, Netherlands 


Died 


January 23, 1997 (aged 75 Philadelphia, United States 


Residence 


Netherlands, United States 


Nationality 


US and Netherlands 


Fields 


Physics, mathematical biology, Quantum biology 


Institutions 


University of Pennsylvania 


Alma mater 


Leiden University, University of Chicago 


Doctoral advisor 


Nicolas Rashevsky 


Other academic advisors 


Hendrik Anthony Kramers 


Known for 


Quantum biology 



George Karreman (b. 4 November 1921 in Rotterdam, Netherlands - 23 January 1997 in Philadelphia, USA) was a 
Dutch-born US physicist, mathematical biophysicist and mathematical/theoretical biologist. He was the first 
President of the Society for Mathematical Biology (SMB) J 1] 

Biography 

Karreman's father was Chief Engineer for the Dutch Merchant Marine. George Karreman studied physics and 

mathematics at Leiden University. In August 1948 Karreman emigrated to Chicago, USA, where he contacted 

Nicolas Rashevsky at the University of Chicago. He became one of Rashevsky' s best PhD students in Mathematical 
T21 

Biophysics. In 1950 Karreman underwent heart surgery at the University of Chicago and from then on had to carry 
an implanted pacemaker. He married Anneke Halbertsma in 1953, and they moved to Cape Cod, Massachusetts, 
where their daughter, Grace, was born in 1954. In slower succession, his first son, Frank Karreman was born in 
1958, and then in 1962 his second son, Hubert- Jan. Later, his three children received advanced degrees from the 
University of Pennsylvania in several fields. 

Karreman was an exceptionally devoted educator, who was always supportive of his research associates, family, and 
friends; he was a generous man, obviously having not forgotten Rashevsky's help in his early life at the University of 
Chicago. He had a wide range of interests in his readings, a keen interest in the fine arts — such as paintings, was 
an advanced chess player, and a most devoted husband and father.^ Between 1987 and 1997, he frequently travelled 
to the Pacific Northwest where his son and daughter had their homes and children. 

Academic career 

Karreman earned his B.S. in Physics and Mathematics in 1939. He completed in 1941 his M.S. in Theoretical 

Physics under the supervision of Hendrik Anthony Kramers at Leiden University. For the remainder of World War II 

he survived by tutoring students in physics and mathematics. He was awarded a University of Chicago Fellowship 

that supported him to complete a Ph.D. in Mathematical Biology in 1951 under the supervision of the founder of 

Mathematical Biophysics and Mathematical Biology, Nicolas Rashevsky Karreman was then selected as a 

consultant to Albert Szent-Gyorgyi at the Institute for Muscle Research, Marine Biological Laboratory, Woods 

Hole.^ . To follow his interest in mathematics applied to biology, physiology, and medicine he went to the 

University of Pennsylvania in Philadelphia in 1957, to take up the position of Senior Medical Research Scientist at 

T71 

the Eastern Research Center. 1 He was appointed associate professor of physiology at the School of Medicine in the 
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University of Pennsylvania, where he also worked at the Bockus Research Institute at the Graduate Hospital. He was 
promoted to Full Professor of Physiology at the same university in 1970, where he held this position until 1983, 
when he became the first Professor Emeritus of Mathematical Biology. His main research interests were many, but 
all were in related fields that included: mathematical biology and mathematical biophysics, membrane biophysics, 
photosynthetic mechanisms, quantum biochemistry and quantum biophysics, ^ biological energy transfer, quantum 
biology/ 10 ^ physiological irritability, mathematical and systems analysis of cardiovascular and other biosystems, 
cooperativity, threshold phenomena in biomembranes, adsorption mechanisms at membrane surfaces and ion binding 
to biomembranes. 

After Rashevsky's passing away in 1972, Karreman was a co-founder of the Society for Mathematical Biology 
(SMB) in 1974 — together with H. Landahl and A. Bartholomay, and in 1975 he became its first president. Karreman 
was also a member of several prestigious scientific societies, including the American Physiological Society, the New 
York Academy of Sciences, the Franklin Institute, the Society for Supramolecular Biology, Sigma Xi, the 
Physiological Society of Philadelphia, and the Society for Vascular System Dynamics. 



Honours 

• 1974 - First President of the The Society for Mathematical Biology. 



See also 

• Society for Mathematical Biology 

• Mathematical and theoretical biology 

• Quantum biology 

• Nicolas Rashevsky 

• Herbert Daniel Landahl 

• Hendrik Anthony Kramers 
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Alberte Pullman 



Alberte Pullman (nee Bucher) was born in France in 1920. She is a theoretical and quantum chemist. She studied at 
the Sorbonne starting in 1938. During her studies she worked on calculations at Centre National de la Recherche 
Scientifique (CNRS). From 1943 she worked with Raymond Daudel. She completed her doctorate in 1946. On his 
return from war service in 1946, she married Bernard Pullman. She and her husband worked together until his death 
in 1996. Together they wrote several books including Quantum Biochemistry, Interscience Publishers, 1963. Their 
work in the 1950s and 1960s was the beginning of the new field of Quantum Biochemistry. They pioneered the 
application of quantum chemistry to predicting the carcinogenic properties of aromatic hydrocarbons. 

She is a member of the International Academy of Quantum Molecular Science and a member and former President 
of The International Society of Quantum Biology and Pharmacology.^ 
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Notes 




He was the father of Carl Feynman and Michelle Feynman. He was the brother of Joan Feynman. 



Richard Phillips Feynman (pronounced /'fainm9n/, May 11, 1918 - February 15, 1988) was an American physicist 
known for his work in the path integral formulation of quantum mechanics, the theory of quantum electrodynamics 
and the physics of the superfluidity of supercooled liquid helium, as well as in particle physics (he proposed the 
parton model). For his contributions to the development of quantum electrodynamics, Feynman, jointly with Julian 
Schwinger and Sin-Itiro Tomonaga, received the Nobel Prize in Physics in 1965. He developed a widely used 
pictorial representation scheme for the mathematical expressions governing the behavior of subatomic particles, 
which later became known as Feynman diagrams. During his lifetime, Feynman became one of the best-known 
scientists in the world. 

He assisted in the development of the atomic bomb and was a member of the panel that investigated the Space 
Shuttle Challenger disaster. In addition to his work in theoretical physics, Feynman has been credited with 
pioneering the field of quantum computing, and introducing the concept of nanotechnology. He held the Richard 
Chace Tolman professorship in theoretical physics at the California Institute of Technology. 

Feynman was a keen popularizer of physics through both books and lectures, notably a 1959 talk on top-down 
nanotechnology called There's Plenty of Room at the Bottom and The Feynman Lectures on Physics. Feynman also 
became known through his semi-autobiographical books {Surely You're Joking, Mr. Feynman! and What Do You 
Care What Other People Think?) and books written about him, such as Tuva or Bust! 

Feynman also had a deep interest in biology, and was a friend of the geneticist and microbiologist Esther Lederberg, 

Ml 

who developed replica plating and discovered bacteriophage lambda. They had several mutual physicist friends 
who, after beginning their careers in nuclear research, moved for moral reasons into genetics, among them Leo 
Szilard, Guido Pontecorvo, and Aaron Novick. 

Biography 
Early life 

Richard Phillips Feynman was born on May 11, 1918,^ in Far Rockaway, Queens, New York.^ His family 
originated from Russia and Poland; both of his parents were Jewish, but they were not devout. In fact, by his early 

T81 

youth, Feynman described himself as an "avowed atheist". Feynman (in common with the famous physicists 
Edward Teller and Albert Einstein) was a late talker; by his third birthday he had yet to utter a single word. The 
young Feynman was heavily influenced by his father, Melville, who encouraged him to ask questions to challenge 
orthodox thinking. From his mother, Lucille, he gained the sense of humor that he had throughout his life. As a child, 
he delighted in repairing radios and had a talent for engineering. His younger sister Joan also became a professional 
physicist [9][10] 



Richard Feynman 



675 



Education 

In high school, his IQ was determined to be 125 (on an unspecified test, making this statement dubious): high, but 

"merely respectable" according to biographer GleickJ 11 ^ Feynman later scoffed at psychometric testing. By 15, he 

had learned differential and integral calculus. Before entering college, he was experimenting with and re-creating 

mathematical topics, such as the half-derivative, utilizing his own notation. In high school, he was developing the 

ri2i 

mathematical intuition behind his Taylor series of mathematical operators. 

His habit of direct characterization sometimes rattled more conventional thinkers; for example, one of his questions 
when learning feline anatomy was "Do you have a map of the cat?" (referring to an anatomical chart). 

Feynman attended Far Rockaway High School, a school that also produced fellow laureates Burton Richter and 
ri3i 

Baruch Samuel Blumberg. 1 A member of the Arista Honor Society, in his last year in high school, Feynman won 
the New York University Math Championship; the large difference between his score and those of his closest 
competitors shocked the judges J 14 ^ 

He applied to Columbia University, but was not accepted, because of the "Jewish quota" (a discriminatory practice 
of limiting the number of places available to students of Jewish background) J- ^ Instead he attended the 
Massachusetts Institute of Technology, where he received a bachelor's degree in 1939, and in the same year was 
named a Putnam Fellow. While there, Feynman took every physics course offered, including a graduate course on 
theoretical physics while only in his second year. 

He obtained a perfect score on the graduate school entrance exams to Princeton University in mathematics and 
physics — an unprecedented feat — but did rather poorly on the history and English portions J- Attendees at 
Feynman's first seminar included Albert Einstein, Wolfgang Pauli, and John von Neumann. He received a Ph.D. 
from Princeton in 1942; his thesis advisor was John Archibald Wheeler. Feynman's thesis applied the principle of 
stationary action to problems of quantum mechanics, laying the groundwork for the "path integral" approach and 
Feynman diagrams, and was entitled "The Principle of Least Action in Quantum Mechanics". 

This was Richard Feynman nearing the crest of his powers. At twenty-three ... there was no physicist on earth 
who could match his exuberant command over the native materials of theoretical science. It was not just a 
facility at mathematics (though it had become clear ... that the mathematical machinery emerging from the 
Wheeler-Feynman collaboration was beyond Wheeler's own ability). Feynman seemed to possess a frightening 
ease with the substance behind the equations, like Albert Einstein at the same age, like the Soviet physicist 
Lev Landau — but few others. 

— James Gleick, Genius: The Life and Science of Richard Feynman 

The Manhattan Project 

At Princeton, the physicist Robert R. Wilson encouraged Feynman to 
participate in the Manhattan Project — the wartime U.S. Army project 
at Los Alamos developing the atomic bomb. Feynman said he was 
persuaded to join this effort to build it before Nazi Germany developed 
their own bomb. 

He was assigned to Hans Bethe's theoretical division, and impressed 

Bethe enough to be made a group leader. He and Bethe developed the 

ri7i 

Bethe-Feynman formula (which might still be classified ) for 
calculating the yield of a fission bomb, which built upon previous work 
by Robert Serber. 




Feynman (center) with Robert Oppenheimer 
(right) relaxing at a Los Alamos social function 
during the Manhattan Project. 
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He immersed himself in work on the project, and was present at the Trinity bomb test. Feynman claimed to be the 
only person to see the explosion without the very dark glasses or welder's lenses provided, reasoning that it was safe 
to look through a truck windshield, as it would screen out the harmful ultraviolet radiation. 

As a junior physicist, he was not central to the project. The greater part of his work was administering the 
computation group of human computers in the Theoretical division (one of his students there, John G. Kemeny, later 
went on to co-write the computer language BASIC). Later, with Nicholas Metropolis, he assisted in establishing the 
system for using IBM punched cards for computation. Feynman succeeded in solving one of the equations for the 
project that were posted on the blackboards. However, they did not "do the physics right" and Feynman's solution 
was not used. 

Feynman's other work at Los Alamos included calculating neutron equations for the Los Alamos "Water Boiler", a 
small nuclear reactor, to measure how close an assembly of fissile material was to criticality. On completing this 
work he was transferred to the Oak Ridge facility, where he aided engineers in devising safety procedures for 
material storage so that criticality accidents (for example, due to sub-critical amounts of fissile material inadvertently 
stored in proximity on opposite sides of a wall) could be avoided. He also did theoretical work and calculations on 
the proposed uranium-hydride bomb, which later proved not to be feasible. 

Feynman was sought out by physicist Niels Bohr for one-on-one discussions. He later discovered the reason: most 
physicists were too in awe of Bohr to argue with him. Feynman had no such inhibitions, vigorously pointing out 
anything he considered to be flawed in Bohr's thinking. Feynman said he felt as much respect for Bohr as anyone 
else, but once anyone got him talking about physics, he would become so focused he forgot about social niceties. 

Due to the top secret nature of the work, Los Alamos was isolated. In Feynman's own words, "There wasn't anything 
to do there". Bored, he indulged his curiosity by learning to pick the combination locks on cabinets and desks used to 
secure papers. Feynman played many jokes on colleagues. In one case he found the combination to a locked filing 
cabinet by trying the numbers a physicist would use (it proved to be 27-18-28 after the base of natural logarithms, e 
= 2.71828...), and found that the three filing cabinets where a colleague kept a set of atomic bomb research notes all 
had the same combination. He left a series of notes as a prank, which initially spooked his colleague, Frederic de 
Hoffman, into thinking a spy or saboteur had gained access to atomic bomb secrets. On several occasions, Feynman 
drove to Albuquerque to see his ailing wife in a car borrowed from Klaus Fuchs, who was later discovered to be a 
real spy for the Soviets, transporting nuclear secrets in his car to Santa Fe. 

On occasion, Feynman would find an isolated section of the mesa to drum in the style of American natives; "and 
maybe I would dance and chant, a little". These antics did not go unnoticed, and rumors spread about a mysterious 
Indian drummer called "Injun Joe". He also became a friend of laboratory head J. Robert Oppenheimer, who 
unsuccessfully tried to court him away from his other commitments after the war to work at the University of 
California, Berkeley. 

Feynman alludes to his thoughts on the justification for getting involved in the Manhattan project in The Pleasure of 
Finding Things Out. As mentioned earlier, he felt the possibility of Nazi Germany developing the bomb before the 
Allies was a compelling reason to help with its development for the US. However, he goes on to say that it was an 
error on his part not to reconsider the situation when Germany was defeated. In the same publication, Feynman also 
talks about his worries in the atomic bomb age, feeling for some considerable time that there was a high risk that the 
bomb would be used again soon so that it was pointless to build for the future. Later he describes this period as a 
"depression." 
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Early academic career 

Following the completion of his Ph.D. in 1942, Feynman held an appointment at the University of 

Wisconsin-Madison (UW) as an assistant professor of physics. The appointment was spent on leave for his 

involvement in the Manhattan project. In 1945, he received a letter from Dean Mark Ingraham of the College of 

Letters and Science requesting his return to UW to teach in the coming academic year. His appointment was not 

extended when he did not commit to return. In a talk given several years later at UW, Feynman quipped, "It's great to 

n 8i 

be back at the only university that ever had the good sense to fire me". 

After the war, Feynman declined an offer from the Institute for Advanced Study in Princeton, New Jersey, despite 

the presence there of such distinguished faculty members as Albert Einstein, Kurt Godel, and John von Neumann. 

Feynman followed Hans Bethe, instead, to Cornell University, where Feynman taught theoretical physics from 1945 
ri2i 

to 1950. During a temporary depression following the destruction of Hiroshima by the bomb produced by the 

Manhattan Project, he focused on complex physics problems, not for utility, but for self-satisfaction. One of these 

was analyzing the physics of a twirling, nutating dish as it is moving through the air. His work during this period, 

which used equations of rotation to express various spinning speeds, soon proved important to his Nobel 

Prize-winning work. Yet because he felt burned out, and had turned his attention to less immediately practical but 

ri2i 

more entertaining problems, he felt surprised by the offers of professorships from renowned universities. 

Despite yet another offer from the Institute for Advanced Study, which would have included teaching duties (which 
was not included in the Institute's initial offer, a factor in his rejection of it), Feynman opted for the California 
Institute of Technology (Caltech) — as he says in his book Surely You 're Joking Mr. Feynman! — because a desire 
to live in a mild climate had firmly fixed itself in his mind while installing tire chains on his car in the middle of a 
snowstorm in Ithaca. 



Feynman has been called the "Great Explainer" J He gained a 

reputation for taking great care when giving explanations to his 

students and for making it a moral duty to make the topic accessible. 

His guiding principle was that if a topic could not be explained in a 

freshman lecture, it was not yet fully understood. Feynman gained 

great pleasure^ from coming up with such a "freshman-level" 

explanation, for example, of the connection between spin and statistics. 

What he said was that groups of particles with spin 1/2 "repel", 

whereas groups with integer spin "clump". This was a brilliantly 

simplified way of demonstrating how Fermi-Dirac statistics and 

Bose-Einstein statistics evolved as a consequence of studying how 

fermions and bosons behave under a rotation of 360°. This was also a 

question he pondered in his more advanced lectures and to which he 

T211 

demonstrated the solution in the 1986 Dirac memorial lecture. In 
the same lecture, he further explained that antiparticles must exist, for 
if particles only had positive energies, they would not be restricted to a so-called "light cone". 

He opposed rote learning or unthinking memorization and other teaching methods that emphasized form over 
function. He put these opinions into action whenever he could, from a conference on education in Brazil to a State 
Commission on school textbook selection. Clear thinking and clear presentation were fundamental prerequisites for 
his attention. It could be perilous even to approach him when unprepared, and he did not forget the fools or 
pretenders P 2 ^ 

During one sabbatical year, he returned to Newton's Principia Mathematica to study it anew; what he learned from 
Newton, he passed along to his students, such as Newton's attempted explanation of diffraction. 



The Feynman Lectures 
on Physics 

Richard P. Feynman 




Feynman the "Great Explainer": The Feynman 

Lectures on Physics found an appreciative 
audience beyond the undergraduate community 
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Caltech years 

Feynman did significant work while at Caltech, including research in: 

• Quantum electrodynamics. The theory for which Feynman won his 
Nobel Prize is known for its accurate predictions.^ ^ This theory 
was developed in the earlier years during Feynman' s work at 
Cornell. He helped develop a functional integral formulation of 
quantum mechanics, in which every possible path from one state to 
the next is considered, the final path being a sum over the 
possibilities (also referred to as sum-over-paths or sum over 

histories) P 5 ^ 

The Feynman section at the Caltech bookstore 

• Physics of the superfluidity of supercooled liquid helium, where 

helium seems to display a complete lack of viscosity when flowing. Applying the Schrodinger equation to the 
question showed that the superfluid was displaying quantum mechanical behavior observable on a macroscopic 
scale. This helped with the problem of superconductivity; however, the solution eluded Feynman P 6 ^ It was 
solved with the BCS theory of superconductivity, proposed by John Bardeen, Leon Neil Cooper, and John Robert 
Schrieffer. 

• A model of weak decay, which showed that the current coupling in the process is a combination of vector and 
axial (an example of weak decay is the decay of a neutron into an electron, a proton, and an anti-neutrino). 
Although E. C. George Sudarshan and Robert Marshak developed the theory nearly simultaneously, Feynman's 
collaboration with Murray Gell-Mann was seen as seminal because the weak interaction was neatly described by 
the vector and axial currents. It thus combined the 1933 beta decay theory of Enrico Fermi with an explanation of 
parity violation. 

He also developed Feynman diagrams, a bookkeeping device which helps in conceptualizing and calculating 
interactions between particles in spacetime, notably the interactions between electrons and their antimatter 
counterparts, positrons. This device allowed him, and later others, to approach time reversibility and other 
fundamental processes. Feynman's mental picture for these diagrams started with the hard sphere approximation, 
and the interactions could be thought of as collisions at first. It was not until decades later that physicists thought of 
analyzing the nodes of the Feynman diagrams more closely. Feynman famously painted Feynman diagrams on the 
exterior of his van/ 27 * 

From his diagrams of a small number of particles interacting in spacetime, Feynman could then model all of physics 

T281 

in terms of those particles' spins and the range of coupling of the fundamental forces. Feynman attempted an 
explanation of the strong interactions governing nucleons scattering called the parton model. The parton model 
emerged as a complement to the quark model developed by his Caltech colleague Murray Gell-Mann. The 
relationship between the two models was murky; Gell-Mann referred to Feynman's partons derisively as "put-ons". 
In the mid 1960s, physicists believed that quarks were just a bookkeeping device for symmetry numbers, not real 
particles, as the statistics of the Omega-minus particle, if it were interpreted as three identical strange quarks bound 
together, seemed impossible if quarks were real. The Stanford linear accelerator deep inelastic scattering experiments 
of the late 1960s showed, analogously to Ernest Rutherford's experiment of scattering alpha particles on gold nuclei 
in 1911, that nucleons (protons and neutrons) contained point-like particles which scattered electrons. It was natural 
to identify these with quarks, but Feynman's parton model attempted to interpret the experimental data in a way 
which did not introduce additional hypotheses. For example, the data showed that some 45% of the energy 
momentum was carried by electrically neutral particles in the nucleon. These electrically neutral particles are now 
seen to be the gluons which carry the forces between the quarks and carry also the three-valued color quantum 
number which solves the Omega-minus problem. Feynman did not dispute the quark model; for example, when the 
fifth quark was discovered in 1977, Feynman immediately pointed out to his students that the discovery implied the 
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existence of a sixth quark, which was duly discovered in the decade after his death. 

After the success of quantum electrodynamics, Feynman turned to quantum gravity. By analogy with the photon, 
which has spin 1 , he investigated the consequences of a free massless spin 2 field, and was able to derive the Einstein 
field equation of general relativity, but little moreJ 29 ^ However, the computational device that Feynman discovered 
then for gravity, "ghosts", which are "particles" in the interior of his diagrams which have the "wrong" connection 
between spin and statistics, have proved invaluable in explaining the quantum particle behavior of the Yang-Mills 
theories, for example QCD and the electro- weak theory. 

In 1965, Feynman was appointed 

foreign member of the Royal 1963 H - P - WIGMER Physus M. QOEPPERT-MAY 

Society.™ At this time in the early 1964 C H J OWNES Ph V sics K. BLOCH Medic 

1()An 1; . , , . • „. , MARTIN LUTHER KING JR Peace 

1960s, Feynman exhausted himself by > . . > „ „- _ 

,. y ,., y 1965 i, SCHWINGER Physics R. P. FEYNMAN P 

working on multiple maior proiects at ' ■ < • „ _ - • 

. ,J R. B. WOODWARD Chemistry 

the same time, including a request, .v b ' D ai ic MpAinri 

J 966 R S MULLIKEN Chemistry P. ROUb Memcm 

while at Caltech, to "spruce up" the ■ffiH ^ ^^HHHHhI 

teaching of undergraduates. After three Mention of Feynman's prize on the monument at the American Museum of Natural 

years devoted to the task he produced History in New York City. Because the monument is dedicated to American Laureates, 
_ Tomonaga is not mentioned. 

a series or lectures that eventually 
became the Feynman Lectures on 

Physics, one reason that Feynman is still regarded as one of the greatest teachers of physics. He wanted a picture of a 
drumhead sprinkled with powder to show the modes of vibration at the beginning of the book. Outraged by many 
rock and roll and drug connections that could be made from the image, the publishers changed the cover to plain red, 
though they included a picture of him playing drums in the foreword. Feynman later won the Oersted Medal for 
teaching, of which he seemed especially proud. 

His students competed keenly for his attention; he was once awakened when a student solved a problem and dropped 
it in his mailbox; glimpsing the student sneaking across his lawn, he could not go back to sleep, and he read the 
student's solution. The next morning his breakfast was interrupted by another triumphant student, but Feynman 
informed him that he was too late. 

Partly as a way to bring publicity to progress in physics, Feynman offered $1000 prizes for two of his challenges in 
nanotechnology, claimed by William McLellan and Tom Newman, respectively. He was also one of the first 
scientists to conceive the possibility of quantum computers. 

Many of his lectures and other miscellaneous talks were turned into books, including The Character of Physical Law 
and QED: The Strange Theory of Light and Matter. He gave lectures which his students annotated into books, such 
as Statistical Mechanics and Lectures on Gravity. The Feynman Lectures on Physics occupied two physicists, 
Robert B. Leighton and Matthew Sands as part-time co-authors for several years. Even though they were not adopted 
by most universities as textbooks, the books continue to be bestsellers because they provide a deep understanding of 
physics. As of 2005, The Feynman Lectures on Physics has sold over 1.5 million copies in English, an estimated 1 
million copies in Russian, and an estimated half million copies in other languages. 

In 1974, Feynman delivered the Caltech commencement address on the topic of cargo cult science, which has the 
semblance of science but is only pseudoscience due to a lack of "a kind of scientific integrity, a principle of scientific 
thought that corresponds to a kind of utter honesty" on the part of the scientist. He instructed the graduating class that 
"The first principle is that you must not fool yourself — and you are the easiest person to fool. So you have to be very 
careful about that. After you've not fooled yourself, it's easy not to fool other scientists. You just have to be honest in 
a conventional way after that."^ 

In 1984-86, he developed a variational method for the approximate calculation of path integrals which has led to a 
powerful method of converting divergent perturbation expansions into convergent strong-coupling expansions 
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(variational perturbation theory) and, as a consequence, to the most accurate determination of critical exponents 
measured in satellite experiments.^ 36 ^ 

In the late 1980s, according to "Richard Feynman and the Connection Machine", Feynman played a crucial role in 
developing the first massively parallel computer, and in finding innovative uses for it in numerical computations, in 
building neural networks, as well as physical simulations using cellular automata (such as turbulent fluid flow), 
working with Stephen Wolfram at Caltech. His son Carl also played a role in the development of the original 
Connection Machine engineering; Feynman influencing the interconnects while his son worked on the software. 

Feynman diagrams are now fundamental for string theory and M-theory, and have even been extended topologically. 
The world-lines of the diagrams have developed to become tubes to allow better modeling of more complicated 
objects such as strings and membranes. However, shortly before his death, Feynman criticized string theory in an 
interview: "I don't like that they're not calculating anything," he said. "I don't like that they don't check their ideas. I 
don't like that for anything that disagrees with an experiment, they cook up an explanation — a fix-up to say, 'Well, it 
still might be true.'" These words have since been much-quoted by opponents of the string-theoretic direction for 
particle physics 

Challenger disaster 



Feynman played an important role on the Presidential Rogers 
Commission, which investigated the Challenger disaster. Feynman 
devoted the latter half of his book What Do You Care What Other 
People Think? to his experience on the Rogers Commission, straying 
from his usual convention of brief, light-hearted anecdotes to deliver 
an extended and sober narrative. Feynman' s account reveals a 
disconnect between NASA's engineers and executives that was far 
more striking than he expected. His interviews of NASA's 
high-ranking managers revealed startling misunderstandings of 




elementary concepts. He concluded that the NASA management's T he 1986 Space Shuttle Cte/te^r disaster, 
space shuttle reliability estimate was fantastically unrealistic. He 

warned in his appendix to the commission's report, "For a successful technology, reality must take precedence over 
public relations, for Nature cannot be fooled. "^ 



Personal life 

While researching for his Ph.D., Feynman married his first wife, Arline Greenbaum (often spelled Arlene). She was 
diagnosed with tuberculosis, but she and Feynman were careful, and he never contracted it. She succumbed to the 
disease in 1945. This portion of Feynman's life was portrayed in the 1996 film Infinity, which featured Feynman's 
daughter Michelle in a cameo role. 

He was married a second time in June 1952, to Mary Louise Bell of Neodesha, Kansas; this marriage was brief and 
unsuccessful. He later married Gweneth Howarth from Ripponden, Yorkshire, who shared his enthusiasm for life 
and spirited adventure J 40 ^ Besides their home in Altadena, California, they had a beach house in Baja California, 
purchased with the prize money from Feynman's Nobel Prize, his one third share of $55,000. They remained married 
until Feynman's death. They had a son, Carl, in 1962, and adopted a daughter, Michelle, in 1968 J 40 ^ 

Feynman had a great deal of success teaching Carl, using discussions about ants and Martians as a device for gaining 
perspective on problems and issues; he was surprised to learn that the same teaching devices were not useful with 
Michelle J 41 ^ Mathematics was a common interest for father and son; they both entered the computer field as 
consultants and were involved in advancing a new method of using multiple computers to solve complex 
problems — later known as parallel computing. The Jet Propulsion Laboratory retained Feynman as a computational 
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consultant during critical missions. One co-worker characterized Feynman as akin to Don Quixote at his desk, rather 
than at a computer workstation, ready to do battle with the windmills. 

Feynman traveled a great deal, notably to Brazil, and near the end of his life schemed to visit the Russian land of 
Tuva, a dream that, because of Cold War bureaucratic problems, never became reality. The day after he died, a 
letter arrived for him from the Soviet government giving him authorization to travel to Tuva. During this period, he 
discovered that he had a form of cancer, but, with surgery, managed to forestall it. Out of his enthusiastic interest in 
reaching Tuva came the phrase "Tuva or Bust" (also the title of a book about his efforts to get there), which was 
tossed about frequently amongst his circle of friends in hope that they, one day, could see it firsthand. The 
documentary movie Genghis Blues mentions some of his attempts to communicate with Tuva, and chronicles the 
successful journey there by his friends. 

Responding to Hubert Humphrey's congratulation for his Nobel Prize, Feynman admitted to a long admiration for the 
then vice president. t43] In a letter to an MIT professor dated December 6, 1966, Feynman expressed interest in 
running for the governor of California/ 44 ^ 

Feynman took up drawing at one time and enjoyed some success under the pseudonym "Ofey", culminating in an 
exhibition dedicated to his work. He learned to play a metal percussion instrument (frigideira) in a samba style in 
Brazil, and participated in a samba school. 

In addition, he had some degree of synesthesia for equations, explaining that the letters in certain mathematical 
functions appeared in color for him, even though invariably printed in standard black-and-white. t45] 

According to Genius, the James Gleick- authored biography, Feynman experimented with LSD during his 
ri4i 

professorship at Caltech. Somewhat embarrassed by his actions, Feynman largely sidestepped the issue when 

dictating his anecdotes; he mentions it in passing in the "O Americano, Outra Vez" section, while the "Altered 

States" chapter in Surely You're Joking, Mr. Feynman! describes only marijuana and ketamine experiences at John 

Lilly's famed sensory deprivation tanks, as a way of studying consciousness. Feynman gave up alcohol when he 

began to show early signs of alcoholism, as he did not want to do anything that could damage his brain — the same 

ri2i 

reason given in "O Americano, Outra Vez" for his reluctance to experiment with LSD. L 

In Surely You're Joking, Mr. Feynman!, he gives advice on the best way to pick up a girl in a hostess bar. At Caltech, 
he used a nude/topless bar as an office away from his usual office, making sketches or writing physics equations on 
paper placemats. When the county officials tried to close the place, all visitors except Feynman refused to testify in 
favor of the bar, fearing that their families or patrons would learn about their visits. Only Feynman accepted, and in 
court, he affirmed that the bar was a public need, stating that craftsmen, technicians, engineers, common workers 
"and a physics professor" frequented the establishment. While the bar lost the court case, it was allowed to remain 
open as a similar case was pending appeal. tl2] 

Feynman developed two rare forms of cancer, Liposarcoma and Waldenstrom's macroglobulinemia, dying shortly 

ri4i 

after a final attempt at surgery for the former on February 15, 1988, aged 69. His last recorded words are noted as 
"I'd hate to die twice. It's so boring. " tl4] t46] 



Popular legacy 

On May 4, 2005, the United States Postal Service issued the American Scientists commemorative set of four 37-cent 
self-adhesive stamps in several configurations. The scientists depicted were Richard Feynman, John von Neumann, 
Barbara McClintock and Josiah Willard Gibbs. Feynman's stamp, sepia-toned, features a photograph of a 
30-something Feynman and eight small Feynman diagrams. The stamps were designed by artist Victor Stabin under 
the direction of U.S. Postal Service art director Carl T. Herrman. 

The main building for the Computing Division at Fermilab is named the "Feynman Computing Center" in his 
honor. 1 ' 71 

Real Time Opera premiered its opera Feynman at the Norfolk (CT) Chamber Music Festival in June 2005 J 48] 
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In the television series Star Trek: The Next Generation, the shuttlecraft Feynman is named after him. 

On the 20th anniversary of Feynman's death, composer Edward Manukyan dedicated a piece for solo clarinet to his 
memory J 49] It was premiered by Doug Storey, the principal clarinetist of the Amarillo Symphony. 

In 2009 and 2010, respectively, clips of an interview with Feynman were used by composer John Boswell as part of 
the Symphony of Science project in his second science education video, We Are All Connected, and in his fifth 
installment called The Poetry of Reality. [50] 
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view?docId=kt5n39p6kO&chunk.id=dsc-1.7.7&query=FEYNMAN&brand=oac) 

• Feynman at The Caltech Institute Archives (http://archives.caltech.edu/search_catalog. 
cfm?search_field=Feynman) 

• The Feynman Lectures Website (http://www.feynmanlectures.info/) 

• About Richard Feynman (http://www.nobel-winners.com/Physics/richard_phillips_feynman.html) 

• Feynman's Scientific Publications (http://www.physik.fu-berlin.de/~kleinert/feynman/feynmanpub.htm) 

• A Biography of R. P. Feynman as a mathematician (http://www-groups.dcs.st-and.ac.uk/~history/ 
Mathematicians/Fey nman . html) 

• Gallery of Drawings by Richard P. Feynman (http://www.museumsyndicate.com/artist.php?artist=380) 

• Photos of Richard Feynman at the Emilio Segre Visual Archives, American Institute of Physics (http://photos. 
aip.org/veritySearchl. jsp?page=l&chapter=0&collection=storeTest&name=Feynman,+Richard+Phillips& 
descl=&search=SEARCH+ ) 

• The Pleasure of Finding Things Out (http://video.google.com/videoplay ?docid=7 136440703094429927): A 
49-minute BBC Horizon TV programme from the 1980s in which Feynman reminisces about his life and work. 

• Edge-video: Murray Gell-mann reminisces about Feynman's personal eccentricities (http://www.edge.org/ 
video/dsl/gell-mann.html) 

• Vega Trust Series videos Feynman gives his Auckland, NZ lectures that ended up the basis for his book, QED. 
(http://vega.org.Uk/video/subseries/8) 

• Biography and Bibliographic Resources (http://www.osti.gov/accomplishments/feynman.html), from the 
Office of Scientific and Technical Information, United States Department of Energy 

• Feynman discussing determining the rules of chess as an analogy for determining laws of physics. (http://www. 
youtube. com/watch? v=o 1 dgrvlWML4) 

• Feynman talks about confusion (http://www. youtube. com/watch? v=lytxafTXg6c) 

• Feynman's BBC interview on uncertainty (http://www. youtube. com/watch?v=_MmpUWEW6Is) 

• Feynman on disrespect of authority (http://www. youtube. com/watch?v=yhDOMxacnIE) 

• Works by Richard Feynman on Open Library at the Internet Archive 
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Murray Gell-Mann lecturing at TED in 2007 


Born 


September 15, 1929Manhattan, New York City, U.S. 


Residence 


United States 


Nationality 


American 


Fields 


Physics 


Institutions 


University of Southern California 
Santa Fe Institute 
California Institute of Technology 
University of New Mexico 


Alma mater 


Yale University, MIT 


Doctoral advisor 


Victor Weisskopf 


Doctoral students 


Kenneth G. Wilson 
Sidney Coleman 
Rod Crewther 
James Hartle 
Christopher T. Hill 
H. Jay Melosh 
Barton Zwiebach 
Kenneth Young 
Todd Brun [1] 


Known for 


Elementary particles 
Gell-Mann matrices 
Gell-Mann-Nishijima formula 
Gell-Mann-Okubo mass formula 
Effective complexity 


Notable awards 


Nobel Prize in Physics (1969) 



Murray Gell-Mann (born September 15, 1929, pronounced /'mAri: "gel "maen/) is an American physicist who 
received the 1969 Nobel Prize in physics for his work on the theory of elementary particles. He is currently the 
Presidential Professor of Physics and Medicine at the University of Southern California. 

He formulated the quark model of hadronic resonances, and identified the SU(3) flavor symmetry of the light quarks, 
extending isospin to include strangeness, which he also discovered. He discovered the V-A theory of chiral neutrinos 
in collaboration with Richard Feynman. He created current algebra in the 1960s as a way of extracting predictions 
from quark models when the fundamental theory was still murky, which led to model-independent sum rules 
confirmed by experiment. 



Murray Gell-Mann 



690 



Gell-Mann, along with Maurice Levy, discovered the sigma model of pions, which describes low energy pion 
interactions. Modifying the integer-charged quark model of Han and Nambu, Fritzsch and Gell-Mann were the first 
to write down the modern accepted theory of quantum chromodynamics although they did not anticipate asymptotic 
freedom. 

Gell-Mann is responsible for the see-saw theory of neutrino masses, that produces masses at the inverse-GUT scale 
in any theory with a right-handed neutrino, like the SO(10) model. He is also known to have played a large role in 
keeping string theory alive through the 1970s, supporting that line of research at a time when it was unpopular. 

Biography 

Born in lower Manhattan into a family of Jewish immigrants, his father was a Romanian Jew from Czernowitz, 
Austro-Hungarian Empire, 1 ^ ^ Gell-Mann quickly revealed himself as a child prodigy. Propelled by an intense 
boyhood curiosity and love for nature, he entered Yale as a member of Jonathan Edwards College at fifteen after 
graduating valedictorian from the Columbia Grammar & Preparatory School. 

Gell-Mann's work in the 1950s involved recently discovered cosmic ray particles that came to be called kaons and 
hyperons. Classifying these particles led him to propose a new quantum number called strangeness. Another of 
Gell-Mann's ideas is the Gell-Mann— Nishijima formula, which was, initially, a formula based on empirical results, 
but was later explained by the quark model. Gell-Mann and Abraham Pais were involved in explaining many 
puzzling aspects of the physics of these particles. 

In 1961, this led him (and Kazuhiko Nishijima) to introduce a classification of elementary particles called hadrons 
(also independently proposed by Yuval Ne'eman six months earlier, although Gell-Mann alone got the Nobel Prize). 
This scheme is now explained by the quark model. Gell-Mann's own name for the classification scheme was the 
Eightfold Way, because of the octets of particles in the classification. The term is a reference to the eightfold way of 
Buddhism. 

Gell-Mann, and, independently, George Zweig, went on, in 1964, to postulate the existence of quarks, the particles 
from which the hadrons are composed. The name was coined by Gell-Mann and is a reference to the novel 
Finnegans Wake, by James Joyce ("Three quarks for Muster Mark!" book 2, episode 4). Zweig had referred to the 
particles as "aces",^ but Gell-Mann's name caught on. Quarks were soon accepted as the underlying elementary 
objects in the study of the structure of hadrons. In 1972 he introduced with Harald Fritzsch the quantum number 
'color' and later, in a joint paper with Heinrich Leutwyler, the full theory of quantum chromodynamics (QCD) was 
released as the gauge theory of strong interactions (cf. references). The quark model is part of QCD, and has been 
robust enough to survive the discovery of other flavours of quarks. Gell-Mann and Richard Feynman, working 
together, and a rival group of George Sudarshan and Robert Marshak, were the first to discover the vector structure 
of the weak interaction in physics. This work followed the discovery of parity violation by Chien-Shiung Wu, as 
suggested by Chen Ning Yang and Tsung-Dao Lee. 

In the 1990s his interest turned to the emerging study of complexity, where he was closely associated with the Santa 
Fe Institute. He wrote a popular science book about these matters, The Quark and the Jaguar: Adventures in the 
Simple and the Complex. The title of the book is taken from a line of an Arthur Sze poem: "The world of the quark 
has everything to do with a jaguar circling in the night." Gell-Mann is also a collector of East Asian antiquities and a 
keen linguist. 

George Johnson wrote a biography of Gell-Mann, which is entitled Strange Beauty: Murray Gell-Mann and the 
Revolution in 20th-century Physics. 
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Timeline 

Gell-Mann earned a bachelor's degree in physics from Yale University in 1948, and a PhD in physics from MIT in 
1951. He was a postdoctoral research associate in 1951, and a visiting research professor at University of Illinois at 
Urbana-Champaign from 1952 to 1953. After serving as Visiting Associate Professor at Columbia University in 
1954-55, he became a professor at the University of Chicago before moving to the California Institute of 
Technology, where he taught from 1955 until 1993. He was awarded a Nobel Prize in physics in 1969 for his 
discovery of a system for classifying subatomic particles. 

He is currently the Robert Andrews Millikan Professor of Theoretical Physics Emeritus at Caltech as well as a 
University Professor in the Physics and Astronomy Department of the University of New Mexico in Albuquerque, 
New Mexico. He is a member of the editorial board of the Encyclopaedia Britannica. In 1984 Gell-Mann co-founded 
the Santa Fe Institute — a non-profit research institute in Santa Fe, New Mexico — to study complex systems and 
disseminate the notion of a separate interdisciplinary study of complexity theory. There he met and befriended 
Pulitzer Prize- winning novelist Cormac McCarthy J 6 ^ 

Personal life 

Gell-Mann married Marcia South wick in 1992, after the death of his first wife, J. Margaret Dow (d. 1981), whom he 
married in 1955. His children are Elizabeth Sarah Gell-Mann (b. 1956) and Nicholas Webster Gell-Mann (b. 1963); 
and he has a stepson, Nicholas South wick Levis (b. 1978). 

Awards 

• Dannie Heineman Prize (1959) 

• Erice Prize (1990) 

• Ernest O. Lawrence Award (1966) 

• Franklin Medal (1967) 

• John J. Carty Medal (1968) 

• Nobel Prize in Physics (1969) 

• Humanist of the Year (2005) 

• Albert Einstein Medal (2005) 

Awards and honors 

• Yale University — D.Sc (h.c), 1959 

• American Physical Society — Dannie Heineman Prize, 1959 

• Ernest O. Lawrence Award, 1966 

• University of Chicago — Sc. D. (h.c), 1967 

• Franklin Institute of Philadelphia — Franklin Medal, 1967 

• National Academy of Sciences — John J. Carty Medal, 1968 

• University of Illinois — Sc.D.(h.c), 1968 

• Wesleyan University — Sc.D.(h.c), 1968 

• Research Corporation Award, 1969 

• University of Turin, Italy — Honorary Doctorate, 1969 

• Nobel Prize in Physics, 1969 

• University of Utah — Sc.D.(h.c), 1970 

• Columbia University — Sc.D.(h.c), 1977 

• University of Cambridge, England — Sc. D. (h.c), 1980 

• United Nations Environment Programme Roll of Honor for Environmental Achievement (The Global 500), 1988 
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• World Federation of Scientists — Erice Prize, 1990 

• University of Oxford, England — D.Sc.(h.c), 1992 

• Southern Illinois University — Sc.D.(h.c), 1993 

• University of Florida — Sc.D.(h.c), Doctorate of Natural Resources, 1994 

• Southern Methodist University — Sc.D.(h.c), 1999 

• Telluride Tech Festival Award of Technology, Telluride, Colorado, 2002 

• American Humanist Association - Humanist of the Year, 2005 

Notes 

[1] T. A. Brun (2009). "Applications of the decoherence formalism" (http://en.scientificcommons.org/43071322). PhD Thesis. California 

Institute of Technology. . Retrieved 2010-10-01. 
[2] "Nobel Prize Winner Appointed Presidential Professor at USC" (http://uscnews.usc.edu/university/ 

nobel_prize_winner_appointed_presidential_professor_at_usc . html) . . 
[3] M. Gell-Mann (October 1997). "My Father" (http://www.webofstories.com/play/10555). Web of Stories. . Retrieved 2010-10-01. 
[4] J. Brockman (2003). "The Making of a Physicist: A talk with Murray Gell-Mann" (http://www.edge.org/3rd_culture/gell-mann03/ 

gell-mann_print.html). Edge.org. . Retrieved 2010-10-01. 
[5] G. Zweig (1980) [1964]. "An SU(3) model for strong interaction symmetry and its breaking II" (http://cdsweb.cern.ch/search. 

py?recid=570209&ln=en). In D. Lichtenberg and S. Rosen. Developments in the Quark Theory ofHadrons. 1. Hadronic Press, pp. 22-101. . 
[6] D. Kushner (27 December 2007). "Cormac McCarthy's Apocalypse" (http://74.220.215. 94/~davidkus/index.php?view=article& 

catid=35:articles&id=61:cormac-mccarthys-apocalypse-&format=pdf&option=com_content&Itemid=54). Rolling Stone. . Retrieved 

2010-10-01. 

References and further reading 

• Biography and Bibliographic Resources (http://www.osti.gov/accomplishments/gellmann.html), from the 
Office of Scientific and Technical Information, United States Department of Energy 

• Encyclopaedia Britannica's Biography of Murray Gell-Mann (http://www.britannica.com/eb/ 
article-9036327?query=murray gell man&ct=) 

• Fritzsch, Gell-Mann, Leutwyler "Advantages of the color octet gluon picture", Physics Letters B 47, 1973, p. 365; 
Fritzsch, Gell-Mann "Current algebra- quarks and what else?", 16. International Conference High energy physics, 
Cern 1972, vol.2, p. 135 

• Murray Gell-Mann Home page at Santa Fe Institute (http://www.santafe.edu/~mgm/) 

• Murray Gell-Mann (http://webofstories.com/gl/murray.gell-mann) video at Web of Stories (http:// 
webof stories . com) 

• Strange Beauty home page (http://talaya.net/strangebeauty.html) 

• TedTalks March 2007: Beauty and truth in physics (http://www.ted.com/index.php/talks/view/id/194) 

• The Making of a Physicist: A Talk With Murray Gell-Mann (http://www.edge.org/3rd_culture/gell-mann03/ 
gell-mann_print . html) 

• The Man Who Knows Everything (http://query.nytimes.com/gst/fullpage. 
html?res=9504E6DB1530F93BA35756C0A962958260), David Berreby, New York Times, May 8, 1994 

• The Man With Five Brains (http://www. webcitation.org/query ?url=http://www. geocities.com/ 
omegaman_uk/gellmann.html&date=2009-10-26+00:00:53) 

• The many worlds of Murray Gell-Mann (http://physicsweb.Org/articles/world/16/6/2/l) 

• The Simple and the Complex, Part I: The Quantum and the Quasi-Classical with Murray Gell-Mann, Ph.D. (http:/ 
/ww w. williamj ames . com/transcripts/gell 1 . htm) 

• Nobel Prize Biography (http://nobelprize.org/nobel_prizes/physics/laureates/1969/gell-mann-bio.html) 
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External links 

• Biography and Bibliographic Resources (http://www.osti.gov/accomplishments/gellmann.html), from the 
Department of Energy, Office of Scientific & Technical Information 

• Gell-Mann's Home Page at SFI (http://www.santafe.edu/~mgm/) 

• TED Talks: Murray Gell-Mann on beauty and truth in physics (http://www.ted.com/talks/view/id/194) at 
TED in 2007 

• TED Talks: Murray Gell-Mann on the ancestor of language (http://www.ted.com/talks/view/id/276) at TED 
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if 


Steven Weinberg at the 2010 Texas Book Festival. 


Born 


May 3, 1933New York City, New York, USA 


Residence 


United States 


Nationality 


United States 


Fields 


Physics 


Institutions 


University of California, Berkeley 
MIT 

Harvard University 
University of Texas at Austin 


Alma mater 


Cornell University 
Princeton University 


Doctoral advisor 


Sam Treiman 


Doctoral students 


Orlando Alvarez 
Claude Bernard 
Lay Nam Chang 
Bob Holdom 
Ubirajara van Kolck 
Rafael Lopez-Mobilia 
John Preskill 
Fernando Quevedo 
Mark G. Raizen 
Scott Willenbrock 


Known for 


Electromagnetism and Weak Force 
unification 

Weinberg-Witten theorem 


Influenced 


Alan Guth 


Notable awards 


Nobel Prize in Physics (1979) 


Notes 

He is married to the professor of law, Louise Weinberg. 



Steven Weinberg (born May 3, 1933) is an American physicist and Nobel laureate in Physics for his contributions 
with Abdus Salam and Sheldon Glashow to the unification of the weak force and electromagnetic interaction 
between elementary particles. 
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Biography 

Steven Weinberg was born in 1933 in New York City to Jewish immigrants Frederick and Eva Weinberg, but is an 
atheist^ . He graduated from Bronx High School of Science in 1950 and received his bachelor's degree from Cornell 
University in 1954, living at the Cornell branch of the Telluride Association. He left Cornell and went to the Niels 
Bohr Institute in Copenhagen where he started his graduate studies and research. After one year, Weinberg returned 
to Princeton University where he earned his Ph.D. degree in Physics in 1957, studying under Sam Treiman. 

Academic career 

After completing his Ph.D., Weinberg worked as a post-doctoral researcher at Columbia University (1957-1959) and 
University of California, Berkeley (1959) and then he was promoted to faculty at Berkeley (1960-1966). He did 
research in a variety of topics of particle physics, such as the high energy behavior of quantum field theory, 
symmetry breaking, pion scattering, infrared photons and quantum gravity. It was also during this time that he 
developed the approach to quantum field theory that is described in the first chapters of his book The Quantum 
Theory of Fields and started to write his textbook Gravitation and Cosmology. Both textbooks, perhaps especially 
the second, are among the most influential texts in the scientific community in their subjects. 

In 1966, Weinberg left Berkeley and accepted a lecturer position at Harvard. In 1967 he was a visiting professor at 
MIT. It was in that year at MIT that Weinberg proposed his model of unification of electromagnetism and of nuclear 
weak forces (such as those involved in beta-decay and kaon-decay)^ . This model is now known as the electro weak 
unification theory. An important feature of this model is the prediction of the existence of another interaction 
mechanism between leptons, known as neutral current and mediated by the Z boson. The 1973 experimental 
discovery of this Z boson was one verification of the electroweak unification. The paper by Weinberg in which he 
presented this theory was one of the highest cited theoretical works ever in high energy physics as of 2009 ^ . 

After his 1967 seminal work on the unification of weak and electromagnetic interactions, Steven Weinberg 
continued his work in many aspects of particle physics, quantum field theory, gravity, super symmetry, superstrings 
and cosmology, as well as a theory called Technicolor. 

In the years after 1967, the full Standard Model of elementary particle theory was developed through the work of 
many contributors. In it, the weak and electromagnetic interactions already unified by the work of Weinberg, Abdus 
Salam and Sheldon Glashow, are further unified with the strong interactions, in one overarching theory. One of its 
fundamental aspects was the prediction of the existence of the Higgs boson. In 1973 Weinberg proposed a 
modification of the Standard Model which did not contain that model's fundamental Higgs boson. 

Weinberg became Higgins Professor of Physics at Harvard University in 1973. 

It is of special importance that in 1979 he pioneered the modern view on the renormalization aspect of quantum field 
theory that considers all quantum field theories as effective field theories and changed completely the viewpoint of 
previous work (including his own) that a sensible quantum field theory must be renormalizable^ . This approach 
allowed the development of effective theory of quantum gravity , low energy QCD, heavy quark effective field 
theory and other developments, and it is a topic of considerable interest in current research. 

In 1979, after the experimental discovery of the neutral currents — i.e. the discovery of the inferred existence of the 
Z boson — , Steven Weinberg was awarded the Nobel Prize in Physics together with Abdus Salam and Sheldon 
Glashow for developing their theory of electroweak unification. 

In 1982 Weinberg moved to the University of Texas at Austin as the Jack S. Josey-Welch Foundation Regents Chair 
in Science and founded the Theory Group of the Physics Department. 

roi 

There is current (2008) interest in Weinberg's 1976 proposal of the existence of new strong interactions — a 
proposal dubbed "Technicolor" by Leonard Susskind — because of its chance of being observed in the LHC as an 
explanation of the hierarchy problem. 
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Steven Weinberg's influence and importance are confirmed by the fact that he is frequently among the top scientists 

rm 

with highest research effect indices, such as the h-index and the creativity index. 1 

Other intellectual legacy 

Besides his scientific research, Steven Weinberg has been a prominent public spokesman for science, testifying 
before Congress in support of the Superconducting Super Collider, writing articles for the New York Review of 
Books^ , and giving various lectures on the larger meaning of science. His books on science written for the public 
combine the typical scientific popularization with what is traditionally considered history and philosophy of science 
and atheism. 

Weinberg was a major participant in what is known as the Science Wars, standing with Paul R. Gross, Norman 
Levitt, Alan Sokal, Lewis Wolpert, and Richard Dawkins, on the side arguing for the hard realism of science and 
scientific knowledge and against the constructionism proposed by such social scientists as Stanley Aronowitz, Barry 
Barnes, David Bloor, David Edge, Harry Collins, Steve Fuller, and Bruno Latour. 

Weinberg is also known for his support of Israel. While this is not extraordinary in itself, he, like many American 
Jews, supports Israel from a liberal point of view. He wrote an essay titled "Zionism and Its Cultural Adversaries" to 
explain his views on the issue. 

Weinberg has canceled trips to universities in the United Kingdom because of British boycotts directed towards 
Israel. He has explained: 

"Given the history of the attacks on Israel and the oppressiveness and aggressiveness of other countries in the 
Middle East and elsewhere, boycotting Israel indicated a moral blindness for which it is hard to find any 
explanation other than antisemitism}^ 

His views on religion were expressed in a speech from 1999 in Washington, D.C.: 

"With or without religion, good people can behave well and bad people can do evil; but for good people to do 
ri2i 

evil — that takes religion. 
He has also said: 

ri3i 

"The more the universe seems comprehensible, the more it seems pointless. 
He attended and was a speaker at the Beyond Belief symposium in November 2006. 

Personal 

He is married to Louise Weinberg and has one daughter, Elizabeth. 

Honors and awards 

The honors and awards that Professor Weinberg received include: 

• Honorary Doctor of Science degrees from a dozen institutions: University of Chicago, Knox College, University 
of Rochester, Yale University, City University of New York, Dartmouth College, Weizmann Institute, Clark 
University, Washington College, Columbia University, Bates College. 

• American Academy of Arts and Sciences, elected 1968 

• National Academy of Sciences, elected 1972 

• J. R. Oppenheimer Prize, 1973 

• Dannie Heineman Prize for Mathematical Physics, 1977 

• Steel Foundation Science Writing Award, 1977, for authorship of The First Three Minutes (1977) 

• Elliott Cresson Medal (Franklin Institute), 1979 

• Nobel Prize in Physics, 1979 
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• Elected to American Philosophical Society, Royal Society of London (Foreign Honorary Member), Philosophical 
Society of Texas 

• James Madison Medal of Princeton University, 1991 

• National Medal of Science, 1991 

• Lewis Thomas Prize for Writing about Science, 1999. 

• 2002 Humanist of the Year, American Humanist Association 

• James Joyce Award, University College Dublin, 2009 

Selected publications 

Bibliography: books authored / coauthor ed 

• Gravitation and Cosmology: Principles and Applications of the General Theory of Relativity (1972) 

• The First Three Minutes: A Modern View of the Origin of the Universe (1977, updated with new afterword in 
1993, ISBN 0-465-02437-8) 

• The Discovery of Subatomic Particles (1983) 

• Elementary Particles and the Laws of Physics: The 1986 Dirac Memorial Lectures (1987; with Richard Feynman) 

• Dreams of a Final Theory: The Search for the Fundamental Laws of Nature (1993), ISBN 0-09-922391-0 

• The Quantum Theory of Fields (three volumes: 1995, 1996, 2003) 

• Facing Up: Science and Its Cultural Adversaries (2001, 2003, HUP) 

• Glory and Terror: The Coming Nuclear Danger (2004, NYRB) 

• Cosmology (2008, OUP) 

• Lake Views: This World and the Universe (2010), Belknap Press of Harvard University Press, ISBN 0674035151. 
Scholarly articles 

• Weinberg, S., A Model of Leptons [14] , Phys. Rev. Lett. 19, 1264-1266 (1967) - the electroweak unification 
paper. 

• Weinberg, S. & G. Feinberg. "Law of Conservation of Muons" ^ 15 ^, Columbia University, University of 
California-Berkeley, United States Department of Energy (through predecessor agency the Atomic Energy 
Commission), (Feb. 1961). 

• Pais, A., Weinberg, S., Quigg, C, Riordan, M., Panofsky, W.K.H. & V. Trimble. "100 years of elementary 
particles" ^ l6 \ Stanford Linear Accelerator Center United States Department of Energy, Beam Line, vol. 27, issue 
1, Spring 1997. (April 1,1997). 

Popular articles 

ri7i 

• A Designer Universe? , critically discussing the possibility of the intelligent design of the universe, is based on 
a talk given in April 1999 at the Conference on Cosmic Design of the American Association for the Advancement 
of Science in Washington, D.C. 

References and notes 

n 8i 

• His Nobel prize autobiography serves as a general reference to this article. 

[ 1 ] http://brainz.org/50-most-brilliant-atheists-all-time/ 

[2] A partial list of this work is: Weinberg, S. Phys. Rev. 118 838-849 (1960); Weinberg, S. Phys. Rev. 127 965-970 (1962); Weinberg, S. Phys. 

Rev. Lett. 17 616-621 (1966); Weinberg, S. Phys. Rev. 140 B516-B524 (1965). 
[3] Weinberg, S. Phys. Rev. 133, B1318-B1332 (1964); Weinberg, S. Phys. Rev. 134 B882-B896 (1964); Weinberg, S. Phys. Rev. 181 1893-1899 

(1969) 

[4] Weinberg, S. Phys. Rev.Lett. 19 1264-1266 (1967). 

[5] SPIRES: Top Cited Articles of All Time (2009 edition) (http://www.slac.stanford.edu/spires/topcites/2009/alltime.shtml) 
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[6] Weinberg, S. Physica 96A, 327 (1979) 

[7] Donoghue, J. F. Phys. Rev. D 50, 3874 (1994) 

[8] Weinberg, S. Phys. Rev. D13 974-996 (1976). 

[9] In 2006 Weinberg had the second highest creativity index among physicists http://physicsweb.Org/articles/news/10/8/13/l 
[10] His articles in the New York Review of Books (http://www.nybooks.com/authors/201) 

[11] "Nobel laureate cancels London trip due to anti-Semitism" (http://www.ynetnews.com/articles/0,7340,L-3404128,00.html). YNet News 

Jewish Daily. May 24, 2007. . Retrieved 2007-06-01. 
[12] Steven Weinberg. "A Designer Universe?" (http://www.physlink.com/Education/essay_weinberg.cfm). . Retrieved 2008-07-14. "A 

version of the original quote from address at the Conference on Cosmic Design, American Association for the Advancement of Science, 

Washington, D.C. in April 1999" 

[13] "What do you get if you divide science by God?" (http://news.bbc.co.Uk/2/hi/uk_news/magazine/7955846.stm). BBC News. March 

24, 2009. . Retrieved 2009-03-24. 
[14] http://cos.cumt.edu.cn/jpkc/dxwl/zl/zll/Physical%20Review%20Classics/particle/066.pdf 
[15] http://www.osti. gov/cgi-bin/rd_accomplishments/display_biblio.cgi?id=ACC0126&numPages=12&fp=N 
[16] http://www.osti. gov/cgi-bin/rd_accomplishments/display_biblio.cgi?id=ACC0054&numPages=55&fp=N 
[17] http :// www . phy slink. com/Education/ essay_weinberg . cfm 

[18] http://nobelprize.org/nobel_prizes/physics/laureates/1979/weinberg-autobio.html 

External links 

• Biography and Bibliographic Resources (http://www.osti.gov/accomplishments/weinberg.html), from the 
Office of Scientific and Technical Information, United States Department of Energy 

• Home Page of Steven Weinberg at University of Texas at Austin (http://www.ph.utexas.edu/~weintech/ 
weinberg.html) 

• Steven Weinberg on LHC (http://www.youtube.com/watch?v=Z14W3DYTIKw) 

• In CERN Courier, Steven Weinberg reflects on spontaneous symmetry breaking (http://cerncourier.com/cws/ 
article/cern/32522) 

• Oral history interview transcript with Steven Weinberg June 28, 1991, American Institute of Physics, Niels Bohr 
Library & Archives (http://www.aip.org/history/ohilist/5146.html) 

• Weinberg author page and archive (http://www.nybooks.com/authors/201) from The New York Review of 
Books 

• Publications (http://arxiv.org/find/hep-th/l/au:+Weinberg_S/0/l/0/all/0/l) on ArXiv 
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Anatole Abragam 


Born 


December 15, 1914 


Nationality 


French 


Fields 


physics 


Alma mater 


University of Paris 


Known for 


The Principles of Nuclear 
Magnetism 

nuclear magnetic resonance 


Notable awards 


Lorentz Medal in 1982 



Anatole Abragam (born December 15, 1914) is a French physicist who wrote The Principles of Nuclear Magnetism 
and has made significant contributions to the field of nuclear magnetic resonance/ 1 ^ Originally from Russia, 
Abragam and his family emigrated to France in 1925. 

After being educated at the University of Paris, (1933-1936), he served in the Second World War. After the war, he 

resumed his studies at the Ecole Superieure d'Electricite and subsequently obtained his Ph.D. from Oxford 

University in 1950 under the supervision of Maurice Pryce. In 1976, he was made an Honorary Fellow of both 

T21 

Merton, Magdalen, and Jesus Colleges, Oxford. 

From 1960 to 1985, he worked as a professor at the College de France. He was awarded the Lorentz Medal in 
1982. 

Books 

• Abragam, Anatole (1961). The Principles of Nuclear Magnetism. Clarendon Press, p. 599. OCLC 242700. 

• Abragam A & Bleaney B. Electron paramagnetic resonance of trans itionions . Oxford, England: Oxford 
University Press, 1970.^ 

• Abragam, Anatole (1989). Time Reversal, an autobiography. Oxford University Press. OCLC 18989324. Worth 
reading for insight into science, scientific politics, and the human condition — similar in this respect to molecular 
biologist Francois Jacob's autobiography The Statue Within (Basic Books, 1988). 
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§tefan Procopiu 


Born 


January 18, 1890Barlad 


Died 


February 22, 1972Ia§i 


Residence 


Ia§i 


Citizenship 


Romanian 


Nationality 


Romanian 


Fields 


Physics 


Alma mater 


Alexandru loan Cuza University of Ia§i 


Known for 


Bohr-Procopiu magneton 
Procopiu effect 
Procopiu phenomenon 


Notable awards 


Romanian State Prize (1964) 



§tefan Procopiu (b. January 19, 1890 in Barlad, Romania, d, August 22, 1972 in Ia§i, Romania) was a Romanian 
physicist. 



Biography 

§tefan Procopiu was born on January 19, 1890 in Barlad. His father, Emanoil Procopiu, was employed at the Barlad 

courthouse. His mother, Ecaterina Ta§ca was the daughter of Gheorghe I. Ta§ca (see Ta§ca family) ^ He attended 

the Gheorghe Ro§ca Codreanu High School in Barlad from 1901 to 1908, continuing his studies at the Faculty of 

Sciences of the "Alexandru loan" Cuza University of Ia§i from 1908 to 1912. After graduation he became assistant to 
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professor Dragomir Hurmuzescu. 

In 1919 he obtained a scholarship to continue his studies in Paris, attending courses of famous scientists, such as 
Gabriel Lippmann, Marie Curie, Paul Langevin, Aime Cotton. On 5 March 1924, Procopiu obtained the title of 
doctor in physics with the thesis "On the electric birefringence of suspensions" presented to a commission including 
professor Aime Cotton as coordinator and Charles Fabry and Henri Mouton as cross-examiners. 

After his return to Romania on January 15, 1925 professor of the gravitation, heat and electricity department of the 
"Alexandru loan Cuza" University of Ia§i, replacing his former teacher Dragomir Hurmuzescu, who had retired., 
Procopiu coordinated the department until his retirement in 1962.^ At the same time he was appointed professor at 
the "Gheorghe Asachi" Polytechnic Institute of Ia§i In 1939 §tefan Procopiu published his treatise on "Electricity 
and Magnetism", followed in 1948 by his monography on "Thermodynamics". 
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On June, 1948 he was appointed corresponding member of the Romanian Academy, being promoted to full 
membership on July 2, 1955. ^ In 1964 he was awarded the Romanian State Prize 1 ^ He was also decorated with the 
Order of Work (Ordinul Muncii), Order of the Star of Romania and the Order of Scientific Merit. Procopiu was also 
selected twice as member in the Commission for the award of the Nobel Prize, 

§tefan Procopiu was also deeply involved in the cultural life of the city of Ia§i. He was an active member of the 
Board of Directors of the National Theatre "Vasile Alecsandri" of Ia§i ^ 

§tefan Procopiu died on August 22, 1972 in Ia§i age 82. ^ 



Scientific activity 

§tefan Procopiu started scientific research even before graduating. He continued this activity while he was assistant 
professor. 

The magnetic moment of electrons 

The first important paper by §tefan Procopiu is "Determining the Molecular Magnetic Moment by M. Planck's 
Quantum Theory". After studying Planck's quantum theory and Langevin's magnetism theory, established the 
magnetic moment and determined the physical constant of magnetic moment, named magneton. ^ §tefan Procopiu 
published his results two years before Niels Bohr made the same discovery independently. The magneton is now 
known as Bohr-Procopiu magneton. 

Continuing his studies, in 1954 he established a method for the experimental determination of the magneton, which 
rsi 

he improved in 1963 . 

Other research before and during World War I 

§tefan Procopiu also worked on wireless communications and in 1913 published a paper on "Experimental Research 
on Wireless Telegraphy". In 1916 he invented a device for locating and establishing the depth of bullets in the bodies 
of the wounded soldiers. 



Longitudinal depolarization of light 

In 1921, Procopiu discovered and analyzed in the Physics Laboratory of Sorbonne University a new optical 
phenomenon which consisted in the longitudinal depolarization of light by suspensions and colloids. ^ . In 1930, the 
occurrence was designated as "Procopiu Phenomenon" by prof. Augustin Boutaric. Part of this research was included 
in Procopiu's doctoral thesis. 

Electromotive force of galvanic elements 

Thus, in 1930, studying the Barkhausen effect, §tefan Procopiu discovered a circular effect of magnetic 
discontinuity. In 1951, this effect was named "Procopiu Effect". ^ This discovery had important applications in the 
development of the memory of computers 

Studies of the earth magnetism 

Earth's magnetism was a continuous concern of §tefan Procopiu, For 25 years he studied this phenomenon in 
Romania and developed the magnetic maps of the country. He also identified the magnetic anomaly located on the 
Ia§i-Boto§ani line. 

In 1947, Procopiu identified a variation of the earth's magnetic field, with a periodicity of approximately 500 years, 
indicating that, starting 1932 earth's magnetic moment increases from the Ecuator to the poles. 
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Main works 

rm 

• Library of Congress L J 
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Ionel Solomon 

Ionel Solomon, Member of the French Academy of Sciences since 20 June 1988, Born 8 January 1929 Ia§i, 
Romania^ 

Citizenship: French 

Nationality: French and Romanian 

Fields', physicist 

Alma mater. Paris-Sorbonne University, in France 

Notable awards 1958 Grand Prix for Research (with Anatole Abragam and J. Combrisson); 1963 The CNRS Silver 
Medal; 1969 Robin Prize of the French Physics Society (Societe Francaise de Physique) Ionel Solomon (born 1929) 
is a French-Romanian physicist, Member of the French Academy of Sciences,CNRS Research Director, and 
Professor at thePolytechnic School in Paris. 

Education 

D 1949-1951 PhD, Polytechnic School (Ecole Polytechnique) in Paris 



Major scientific contributions 
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Ionel Solomon made major contributions to the fields of:Nuclear Magnetic Resonance (NMR)[1] 
doi:10.1103/PhysRev.99.559 [3] , Solid state physics [4] , Semiconductors[2] [5] [6] , and Photovoltaics[3] [7] [8] . In 
Nuclear Magnetic Resonance he derived fundamental equations that bear his name, and specify the nuclear spin-echo 
response and dipole-dipole interactions in solids (the Solomon equations) [4] [5]. Scientific career D 1951-1952 
Research Fellow at the University of Liverpool, UK D 1955-1956 Research Fellow at Harvard University, USA D 
1953-1962 Researcher in the resonance Group of the Atomic Energy Commission (Commissariat Energie Atomique 
in Saclay) D 1962 Director of the Laboratory for Condensed Matter (Solid-State) Physics (Laboratoire de Physique de 
la Matiere Condensee), at the Polytechnic School in Paris D 1962 Head of Research at C.N.R.S. D 1962 Head of 
Conferences D 1968 CNRS Research Director D 1973-1976 Physics Departement Head at the Polytechnic School in 
Paris 1973-1974 President of the Societe Francaise de Physique (the French Physics Society). D 1975-1979 Professor, 
at the Polytechnic School in Paris D 1976 Invited Visiting Professor at the Xerox Research Center, Palo Alto,USA D 
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1980 Invited Visiting Professor at Tokyo University D 1981-1985 Founder and Scientific Director of SOLEMS 
Company D 1987 President of the Scientific Council of PHOTOTRONICS (a French-German Company for 
photovoltaic products) D 1988, June 22, Elected Member of the Physics Institute of the French Academy of 
Sciences [6] D Laboratory of Condensed Matter Physics 

Awards and prizes 

D 1958 Grand Prix for Research (with Anatole Abragam and J. Combrisson) D 1963 The CNRS Silver Medal D 1969 
Robin Prize of the French Physics Society (Societe Francaise de Physique) D 1972 Holweck Prize of the Institute of 
Physics and S.F.P (French Physics Society) D 1981 The Y. Peyches Prize of the Academy of Sciences 

Notes 

[ 1 ] http://www . enotes . com/topic/Ionel_Solomon 

[2] I. Solomon. Relaxation Processes in a system of Two spins. Phys. Rev. 99, 559 (1955) 
[3] http://link.aps.org/doi/10. 1 103/PhysRev.99.559 

[4] http://prola.aps.org/abstract/PR/v99/i2/p559_l Phys. Rev. 99, 559-565 (1955) Relaxation Processes in a System of Two Spins 
[5] I. Solomon. "Amorphous Semiconductors", In "Topics in Applied Physics", Ed. Springer Verlag, Berlin (1979). 

[6] I. Solomon, M.P. Schmidt, H. Tran Quoc. Selective low-power plasma decomposition of silane-methane mixtures for the preparation of 

methylated amorphous silicon. Phys. Rev. B ,38: 9895 (1988) 
[7] I. Solomon, B. Drevillon, H. Shirai, N. Layadi. Plasma deposition of microcrystalline silicon: the selective etching model., J. Non-crystalline 

Solids, 164-166, p. 989 (1993) 

[8] K. Rerbal, F. Jomard, J.N. Chazalviel, F. Ozanam, I. Solomon. Visible luminescence of porous amorphous Si(l-x) Cx:H due to selective 
dissolution of silicon. Appl. Phys. Lett. 83, p. 45 (2003); K. Kerbral, J.N. Chazalviel, F. Ozanam, I. Solomon. Temperature dependence of 
photoluminescence in amorphous Sil-xCx:H films. Eur. Phys. J., B51, p. 61 (2006 
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